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(57) Abstract 

The present invention relates to a corin polypeptide which contains one or more frizzled corin and possesses various activities, 
including, e.g., serin protease activity and a pro-ANF converting activity, LDLR, SRCR repeats, serine protease catalytic domains such as 
human and mouse corin. The invention further relates to methods of using such nucleic acids and polypeptides in therapeutics, diagnostics, 
and research. For example, the nucleic acids and polypeptides of corin can be utilized in methods to identify modulators of its activity, in 
animal models to mimic human disease, and to treat kidney, cardiovascular, and other conditions in which corin is involved. 
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CORIN, A SERINE PROTEASE 

This application is a continuation-in-part of U.S. Serial No. 09/092.029, filed 
5 June 5, 1998. which is hereby incorporated by reference in its entirety. 

BACKGROUND OF THE INVENTION 

Serine proteases participate in a variety of developmental and physiological 
processes (Stroud R, Sci. Am. 231:74-88, 1974: Neurath H, Science 224:350-357, 
10 1984). For instance, serine proteases are involved in cell signaling, cell differentiation, 
and the conversion of pro-hormones to biologically-active forms. See, e.g.. Hong and 
Hashimoto. Cell. 82:785-794. 1995: Inagami. J. Biol Chem. m 264:3043-3046. 1989. 

DESCRIPTION OF THE INVENTION 

15 The present invention relates to nucleic acids, polypeptides, and fragments 

thereof, of a novel gene, e.g., a serine protease, especially a mammalian serine 
protease, such as human and mouse corin, which contains one or more frizzled. LDLR, 
scavenger receptor cysteine-rich repeats, and serine protease catalytic domains. The 
invention further relates to methods of using such nucleic acids and polypeptides in 

2 0 therapeutics, diagnostics, and research. For example, the nucleic acids and 

polypeptides of corin can be utilized in methods to identify modulators of its activity 
and in animal models to mimic human disease, and in the diagnosis of pathological 
conditions. 

2 5 BRIEF DESCRIPTION OF THE DRAWINGS 

Seq. 1 shows a nucleotide sequence for human corin. 

Seq. 2 shows a deduced amino acid sequence of human corin. 

Seq. 3 shows a nucleotide sequence for mouse corin. 

Seq. 4. shows a deduced amino acid sequence for mouse corin. 

3 0 Fig. 1 is a schematic illustrating the arrangement of functional domains within 

a human corin polypeptide. 

Fig. 2 shows an amino acid alignment of frizzled ("Corin Crdl" and "Corin 
Crdl") domains from a human corin polypeptide with Frizzled, Fz-1. and lin-17. 
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Fig 3 shows an amino acid sequence alignment of LDLR-repeats identified 
within a human corin polypeptide with a consensus sequence for human LDLR. 

Fig. 4 shows an amino acid sequence alignment of a human corin serine 
protease domain ("Corin") with serine protease domains present in three other proteins. 
KAL is kalkrein. ENTK is enterokinase. TRP1 is trypsin. 



DETAILED DESCRIPTION OF THE INVENTION 

Nucleic acid and polypeptide sequences have been identified which code for 
corin, a novel gene comprising a transmembrane/signal peptide, frizzled domains, low 
1 0 density liprotein receptor repeats (LDLR), scavenger receptor cysteine-rich repeats 
(SRCR), and a serine protease catalytic domain. See, e.g., Fig. 1. 

In accordance with the present invention, a corin polypeptide has an 
immunogenic activity which is specific for corin; or, an amino acid sequence which is 
obtainable from a naturally-occurring source and which has one or more of the 
15 following activities or domains: a serine protease catalytic activity; a serine protease 
catalytic activity domain; a serine protease binding substrate activity; a pro-atrial 
natriuretic factor (ANF) converting enzyme catalytic activity; a frizzled domain; a 
LDLR repeat domain; and scavenger receptor cysteine-rich repeats (SRCR). 

Preferably, the polypeptide fragments coded for by the following nucleic acid 

2 0 fragments from the Unigene database [PubEST] are excluded: Hs. 62794 (AA1 26468 

[1686098], AA126648 [1686206], AA625395 [2537780], AA046682 [1524579], 
AA249850 [1881 137], and AA046793 [1524691]), and Hs.71798 (AA147031 
[1716421]). (The bracketed numbers refer to the PubEST database.) Optionally, the 
following fragments can also be excluded: Hs.121626 (AA771958), Hs.1657 
25 (M69297), and Hs.47712 (AA203291), gl231787; gl312726; gl337948; and g942724. 
The nucleotide sequences of the aforementioned nucleic acids can be identified be 
searching publicly available databases. However, polypeptides which contain or 
comprise these sequences are not excluded, e.g., full-length corin, a polypeptide having 
two or more of these mentioned fragments, or a polypeptide having one of the 

3 0 mentioned fragments and additional amino acid sequences, either from corin or from 

another source. 
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By the term "immunogenic activity specific for corin, " it is meant that the corin 
polypeptide elicits an immunological response which is selective for corin. Such 
response can be cellular or humoral. Thus, the stimulation of antibodies, T-cells, 
macrophages, B-cells, dendritic cells, etc., by a corin amino acid sequence selected 
5 from a mammalian corin polypeptide, e.g., human or mouse corin as shown in Seqs. 2 
and 4, is a specific immunogenic activity. These responses can be measured routinely. 
See also, discussion below on antibodies specific-for corin. 

Serine protease catalytic activity means, e.g., that the corin polypeptide 
possesses a polypeptide cleavage activity, preferably at a peptide bond, e.g., where a 

1 0 serine residue of corin participates in the cleavage of the peptide bond. See, e.g., 

Stroud, Sci. Am.. 231:74-88, 1974; Kraut Ann, Rev. Biochem., 46:331-358, 1977. The 
entire corin polypeptide sequence shown in Seq. 2 or Seq. 4. or a part of it, can possess 
serine protease catalytic activity. Cleavage after amino acid position 801 of Seq. 2 
(splitting the peptide bond between arginine and isoleucine) can increase or enhance 

15 such catalytic activity, resulting in the release of a catalytic fragment comprising, or 
consisting essentially of, amino acid positions 802-1042. 

A "serine protease catalytic activity domain" refers to a region of a polypeptide 
which is capable of serine protease catalytic activity, as described above, but which 
does not necessarily possess complete, if any, activity in the background in which it is 

2 0 present. For instance, members of the trypsin family are often initially expressed as a 
pro-enzyme which exhibits complete activity only upon cleavage at a specific site 
within the polypeptide. Cleavage results in release of a proteolytic fragment. 
Generally, the "released fragment" is held in place by disulfide bonds with the 
remaining portion of the protein. See, Seq. 2, e.g., for cysteines which could form the 

2 5 disulfide bonds. A serine protease catalytic domain refers to the proteolytic fragment 
prior to its release from the complete corin sequence. 

A "pro-atrial natriuretic factor (ANF) converting enzyme catalytic activity" 
means, e.g., a catalytic activity in which pro- ANF is converted by proteolytic cleavage 
into one or more smaller peptides. See, e.g., Inagami, J. Biol. Chem., 264:3043-3046, 

30 1989; Rosenzweig and Seidman, Ann. Rev. Biochem., 60:229-255, 1991; Wilkins et 
al-. Lancet, 349:1307-1310, 1997. ANF (also, known as ANP) is a cardiac hormone 
that, e.g., regulates body fluid homeostasis, blood pressure, plasma volume, and other 
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physiological processes involved in heart and kidney function. Other substrates for the 
converting activity, can also include homologs and proteins homologous to ANF, such 
as BNP and CNP. See, e.g., Wilkins et al., Lancet, 349:1307-1310, 1997; Levin et al., 
New Eng. J. Med., 339: 321-328, 1998. The converting activity can be measured 
5 conventionally, e.g., as described in the examples where cleavage of pro-ANF is 
assayed by detecting a difference in molecular weight of pro-ANF before and after 
treatment with corin, or with a biologically-active fragment of corin. Since corin can 
be expressed on the cell-surface, intact cells can be used to process substrates, such as 
pro-ANF. 

10 Substrate binding is generally considered the first step in enzyme catalysis 

because the substrate, acting as a ligand, must first attach to the enzyme surface to 
enable the enzyme to carry out its catalytic reactions. This enzyme surface can be 
referred to as the active site of the enzyme. Binding of the substrate to the enzyme 
surface can involve multiple interactions with the enzyme, e.g., chemical bonding with 

15 one or more amino acids and/or functional groups which comprise the enzyme. A 
serine protease substrate binding activity as used herein means that a substrate, e.g., 
(H-D-Pro-Phe-Arg-pNA.2HCl), S2444 (pyroGlu-Gly-Arg-pNA.HCl), and S2288 (H- 
D-Ile-Pro-Arg-pNA.2HCl), respectively, or pro-ANF, attaches specifically to a surface 
of a corin polypeptide. Attachment to the enzyme can be accomplished by one or more 

2 0 of the interactions which hold its naturally-occurring substrate to it; however, a 

polypeptide can have a substrate binding activity when it holds the substrate with less 
than the naturally-occurring number and quality of interactions. A serine protease 
substrate binding activity can optionally be effective: to achieve catalysis of the 
substrate, to competitively or noncompetitively bind to the active site, to irreversibly 

2 5 attach to the enzyme, to result in the loss of catalytic activity (e.g., where it is a suicide 

substrate), etc. 

Serine protease substrate binding activity can be measured conventionally. For 
instance, a competition binding assay can be employed to identify substrates which 
attach to a polypeptide, or derivative thereof, e.g., by combining under effective 

3 0 conditions, a substrate containing a detectable marker, a human corin polypeptide, or 

fragments thereof, and a compound which is to be tested for substrate binding activity. 
The assay can be accomplished in liquid phase, where bound and free substrate are 
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separated by a membrane, or, it can be accomplished in solid phase, as desired. Solid- 
phase assays can be performed using high-throughput procedures, e.g., on chips, 
wafers, etc. Substrate binding and catalytic activity can be dissociated from each other. 
Thus, a corin polypeptide can possess substrate binding activity but not a catalytic 
5 activity. 

A corin polypeptide can also comprise a "frizzled-like cysteine rich domain" (or 
"frizzled domain", as used herein) having Wnt binding activity. A frizzled domain can 
possess a ligand binding activity or region, e.g., as defined and described in Zorn, 
Current Biology , 7:R501-R504, 1997. For example, a frizzled domain can act as a 
1 0 receptor for ligands having homology to Wnts and other secreted glycoproteins 

involved in cell proliferation, cell signaling, etc. It can be membrane bound or soluble. 
In a soluble form, it can also act as antagonist, e.g., when the free frizzled domain of 
corin binds the free ligand and prevents it from interacting with its cognate receptor on 
the cell surface. 

15 An LDLR repeat contains a human LDLR consensus sequence as shown in Fig. 

7 and can optionally have ligand binding activity. 

A corin polypeptide can also comprise a scavenger receptor cysteine-rich repeat 
domain ("SRCR"). See, e.g., Matsumoto et al, Proc. Nad. Acad Sci.. 87:9133-9137, 
1990. 

2 0 A mammalian corin is a mammalian polypeptide having an amino acid 

sequence which is obtainable from a natural source and the mentioned activities. It can 
be full-length (i.e., as shown in Seq. 2) or it can be less than full-length and possess 
one or more of the mentioned activities. It therefore includes naturally-occurring 
normal, mutant, polymorphic, etc., sequences. Natural sources include, e.g., living 

2 5 cells, e.g., obtained from tissues or whole organisms, cultured cell lines, including 

primary and immortalized cell lines, biopsied tissues, etc. 

The present invention also relates to fragments of a full-length mammalian 
corin, such as human or mouse corin. The fragments are preferably "biologically 
active". By "biologically active", it is meant that the polypeptide fragment possesses 

3 0 an activity in a living system or with components of a living system. Biological 

activities include those mentioned, e.g., a catalytic activity, a substrate binding activity, 
and/or an immunogenic activity. Fragments can be prepared according to any desired 
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method, including, chemical synthesis, genetic engineering, cleavage products, etc. 
See, below. 

The present invention also relates to a human corin having a deduced sequence 
of amino acids 1 to 1042 amino acids as shown in Seq. 2. The 1042 amino acid 
5 polypeptide has a predicted molecular weight of about 1 16 kilodaltons. It comprises 
the following domains: hydrophobic region at about amino acid positions 46-66; 
frizzled cysteine-rich domains at about amino acid positions 134-259 and 450-573; 
seven LDLR repeats at about amino acid positions 268-415 and 579-690; a cysteine- 
rich region at about amino acid positions 713-801 homologous to a macrophage 

10 scavenger receptor motif; and a serine protease catalytic domain at about amino acid 
positions 802-1042. There are nineteen putative N-glycosylation sites present in the 
extracellular domain which is at about amino acid positions 67-1042. 

A corin polypeptide of the invention, e.g., having an amino acid sequence as 
shown in Seq. 2 or Seq. 4, can by analyzed by available methods to identify other 

15 structural and/or functional domains in the polypeptide. For example, a corin 

polypeptide can be analyzed by methods disclosed in. e.g., Kyte and Doolittle, J. Mol. 
Bio., 157:105, 1982; EMBL Protein Predict; Rost and Sander. Proteins, 19:55-72, 
1994. 

The hydrophobic domains at about amino acids 46-66 of a human corin can 
2 0 serve as a membrane anchoring sequence. See. also, hepsin. a mammalian serine 

protease of the trypsin family which contains a transmembrane domain near its amino- 
terminus (Kurachi et al., Methods in Enzymology 244:100-1 14. 1994); Stubble- 
stubbloid, a Drosophila melanogaster senne protease which also contains a 
transmembrane domain (Appel et aL. Proc. Natl Acad, Sci. USA 90:4937-4941, 1993). 

2 5 There are positively charged amino acid residues immediately preceding the 

hydrophobic and putative transmembrane domain, indicating that corin may be a type 
II transmembrane protein with the amino terminus in the cytosol. 

Corin also contains at least two cysteine-rich frizzled domains at about amino 
acid positions 134-259 and 450-573. Fig. 2 shows a comparison of the human corin 

3 0 frizzled domains with FZ-1 , lin-1 7, and Frizzled. The frizzled domain comprises, e.g., 

a ligand binding site. See, e.g.. Zorn, Current Biology\ 7:R501-R504, 1997; Leyns et 
al., CelL 88:747-756. 1997. As described below, an aspect of the invention is to 
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identify ligands which bind to the conn frizzled domains, and which optionally 
regulate the activity of cells expressing the corin polypeptide or cells expressing the 
ligand. 

Analysis of the corin protein sequence showed that in the extracellular region 
5 there are two frizzled-like cysteine-rich domains, seven LDL receptor repeats, one 
macrophage scavenger receptor-like domain, and one trypsin-like serine protease 
domain (2 A). Two frizzled-like cysteine-rich domains are located at amino acids 134- 
259 and 450-573, respectively. Amino acid sequences of these two domains share 
significant similarities with the extracellular cystein-rich domain of the Drosophila 

1 0 Frizzled protein, a seven transmembrane receptor essential for polarity determination 
during the development of the fruitfly (19). The frizzled-like cysteine-rich domains 
have also been found in other proteins, such as Dfz2 in Drosophila (20), Lin- 17 in C. 
elegans (21) and FZ-1 in human (22). The sequences of the two frizzled-like 
cysteine-rich domains in corin are closest to those in Lin-1 7 and FZ-1 . All the ten 

15 conserved cysteine residues are present in the frizzled-like cysteine-rich domains of 
corin. 

Between amino acids 268-415 and 579-690, there are seven cysteine-rich 
repeats homologous to the LDL receptor class A repeats (23). Each repeat is about 36 
amino acids long and contains six cysteine residues as well as a highly conserved 
2 0 cluster of negatively charged amino acids. In the LDL receptor, these cysteine-rich 

repeats bind calcium ions and play an essential role in endocytosis of the extracellular 
ligands (23). Similar motifs have been found in the extracellular domain of other 
membrane receptors, such as LDL receptor-related protein (LRP1) (24), megalin (also 
known as LRP2 orgp330) (25), complement proteins (26), enterokinase (27), and 

2 5 Drosophila proteins yolkless and nudel (28,29). 

In addition to the frizzled-like cysteine-rich domains and LDL receptor-like 
repeats, there is another cysteine-rich region between amino acids 713 and 801 in 
corin. This region contains 88 amino acids and is homologous to the cysteine-rich 
motif found in the macrophage scavenger receptor (30). This motif is also present in 

3 0 the sea urchin spermatozoa speract receptor (3 1 ,32) and the vertebrate serine protease, 

enterokinase (27). 
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At the carboxyl terminus of corin protein between amino acid residues 802 and 
1042, there is a trypsin-like serine protease domain. This protease domain is highly 
homologous to the catalytic domain of members of the trypsin superfamily. For 
example, amino acid sequence identities between corin and prekallikrein (33), factor 
5 XI (34) and hepsin (35) are 40%, 40% and 38%, respectively. All essential features of 
serine protease sequences are well conserved in corin. The active site residues of the 
catalytic triad are located at His843, Asp892 and Ser985. The amino acid residues 
forming the substrate specificity pocket are located at Asp979, Glyl007 and Glyl018. 
These residues are predicted to bind the substrate PI residues, suggesting that corin 
10 would cleave its substrate after basic residues, such as lysine or arginine. In addition, a 
putative activation cleavage site was found at Arg801, suggesting that corin would be 
synthesized as an inactive zymogen and that another trypsin-like enzyme was required 
for its activation. 

In the protease domain, there are 12 cysteine residues. Potential pairing of 
15 these cysteine residues can be predicted by comparing with other well-studied serine 
proteases, such as trypsin and chymotrypsin. First three pairs of cysteine residues 
present in essentially all members of the trypsin superfamily are located at 
Cys828-Cys844, Cys955-Cys970 and Cys981-Cysl010. Two more pairs of cysteine 
residues are present at the positions Cys790-Cys912 and Cys926-Cys991. These two 
2 0 pairs of cysteine residues are commonly found in a subfamily of two-chain serine 

proteases, such as chymotrypsin and prekallikrein (33). The presence of Cys790 and 
Cys912 indicated that, after the activation cleavage at Arg801, the catalytic domain of 
corin would remain attached to the rest of molecule by a disulfide bond. Interestingly, 
there is one additional pair of cysteine residues, Cys8 1 7 and Cys830, present in corin. 

2 5 Cysteine residues at these two positions were not found in any other serine proteases in 

vertebrates. A search of databases showed that a chymotrypsinogen-like serine 
protease from the lugworm, Arenicola marina, had two cysteine residues at the 
corresponding positions (36). A model of the corin protease domain was built based 
on the structure of bovine chymotrypsinogen A. Based on this corin model, where the 

3 0 Cot atoms of these two cysteine residues were held fixed during energy minimization, 

the distance between the sulfur atoms of their side chains is about 2.5 A after rotamer 
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searching. The model indicates that these two cysteines are likely to form a disulfide 
bond connecting two b-sheets in the core of the protease domain. 

In addition to the human corin sequence, a corin from another mammalian 
species, mouse, has been cloned and identified. A full-length nucleotide and amino 
5 acid sequence of mouse corin is shown in Seqs. 3 and 4. There are 1 1 13 amino acids in 
the mouse corin and it comprises the same domains as human corin. There is 89% 
amino acid sequence identity between mouse and human corin. 

Other corin homologs from mammalian and non-mammalian sources can be 
obtained according to various methods. For example, hybridization with an 
1 0 oligonucleotide selective for a mammalian corin can be employed to select such 

homologs, e.g., as described in Sambrook et al. ? Molecular Cloning, 1989, Chapter 11. 
Such homologs can have varying amounts of nucleotide and amino acid sequence 
identity and similarity to corin. Non-mammalian organisms include, e.g., vertebrates, 
invertebrates, zebra fish, chicken, Drosophila, C. elegans, Xenopus, S. pombe, S. 
15 cerevisiae, roundworms, prokaryotes, plants, Arabidopsis, viruses, etc. 

The invention also relates to corin specific amino acid sequences, e.g., a 
defined amino acid sequence which is found in the particular human or mouse 
sequences of Seqs. 2 and 4. but not in other amino acid sequences from non-corin 
polypeptides. A specific amino acid sequence can be found routinely, e.g., by 
2 0 searching a gene/protein database using the BLAST set of computer programs. A corin 
specific amino acid sequence can be useful to produce peptides as antigens to generate 
an immune response specific for it. Antibodies obtained by such immunization can be 
used as a specific probe for a mammalian corin protein for diagnostic or research 
purposes. 

2 5 As mentioned, polypeptides of the present invention can comprise a complete 

coding sequence for a corin. or fragments thereof. Useful fragments include, e.g., 
fragments comprising, or consisting essentially, of the frizzled domain, one or more 
LDLR domains, SRCR repeats the transmembrane domain, the serine catalytic domain 
or fragment. A preferred fragment of human corin comprises the polypeptide sequence 

3 0 of Seq. 2, with the proviso that the fragment is not a polypeptide coded for by 

Hs.62794 (AA126468 [1686098], AA126648 [1686206], AA625395 [2537780], 
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AA046682 [1524579], AA249850 [1881 137], and AA046793 [1524691]), and 
Hs.71798 (AA147031 [1716421]). See, above. 

A fragment of a corin polypeptide can be selected to have a specific biological 
activity, e.g., a serine protease activity, a pro-ANF converting enzyme activity, a 
5 substrate binding activity, an immunogenic activity, etc. A useful fragment can be 
identified routinely by testing such fragments for a desired activity. The measurement 
of these activities is described below and in the examples. These peptides can also be 
identified and prepared as described in EP 496 162. 

A polypeptide of the present invention can also have 100% or less amino acid 

10 sequence identity to the amino acid sequence set forth in Seq. 2 or 4. For the purposes 
of the following discussion: Sequence identity means that the same nucleotide or 
amino acid which is found in the sequence set forth in Seqs. 1-4 is found at the 
corresponding position of the compared sequence(s). A polypeptide having less than 
100% sequence identity to the amino acid sequence set forth in Seq. 2 or Seq. 4 can 

15 contain various substitutions from the naturally-occurring sequence, including 

homologous and non-homologous amino acid substitutions. See below for examples 
of homologous amino acid substitution. The sum of the identical and homologous 
residues divided by the total number of residues in the sequence over which the corin 
polypeptide is compared is equal to the percent sequence similarity. For purposes of 

2 0 calculating sequence identity and similarity, the compared sequences can be aligned 

and calculated according to any desired method, algorithm, computer program, etc., 
including, e.g., FASTA, BLASTA. A polypeptide having less than 100% amino acid 
sequence identity to the amino acid sequence of Seq. 2 or Seq. 4 can comprise e.g., 
about 99%, 98%, 97%, 95%, 90%, 70% etc. A preferred amount of amino acid 
25 sequence identity is about 85% or more, e.g., about 86%). 

A mammalian corins polypeptide, fragment, or substituted polypeptide can also 
comprise various modifications, where such modifications include lipid modification, 
methylation, phosphorylation, glycosylation, covalent modifications (e.g., of an R- 
group of an amino acid), amino acid substitution, amino acid deletion, or amino acid 

3 0 addition. Modifications to the polypeptide can be accomplished according to various 

methods, including recombinant, synthetic, chemical, etc. 



SUBSTITUTE SHEET (RULE 26) 



WO 99/64608 PCT/EP99/03895 

- 11 - 

Polypeptides of the present invention (e.g., human conn or mouse corin, 
fragments thereof, mutations thereof) can be used in various ways, e.g., in assays, as 
immunogens for antibodies as described below, as biologically-active agents (e.g., 
having one or more of the activities associated with corin). 
5 A polypeptide coding for a corin, a derivative thereof, or a fragment thereof, 

can be combined with one or more structural domains, functional domains, detectable 
domains, antigenic domains, and/or a desired polypeptide of interest, in an 
arrangement which does not occur in nature, i.e., not naturally-occurring, e.g., as in a 
human or murine corin gene, a genomic fragment prepared from the genome of a living 

1 0 organism, e.g., an animal, preferably a mammal, such as human, mouse, or cell lines 
thereof. A polypeptide comprising such features is a chimeric or fusion polypeptide. 
Such a chimeric polypeptide can be prepared according to various methods, including, 
chemical, synthetic, quasi-synthetic, and/or recombinant methods. A chimeric nucleic 
acid coding for a chimeric polypeptide can contain the various domains or desired 

15 polypeptides in a continuous (e.g., with multiple N-terminal domains to stabilize or 
enhance activity) or interrupted open reading frame, e.g., containing introns, splice 
sites, enhancers, etc. The chimeric nucleic acid can be produced according to various 
methods. See, e.g., U.S. Pat. No. 5,439,819. A domain or desired polypeptide can 
possess any desired property, including, a biological function such as catalytic, 

2 0 signalling, growth promoting, cellular targeting (e.g., signal sequence, targeting 

sequence, such as to endosomes, lysosomes, ER, nucleus), etc., a structural function 
such as hydrophobic, hydrophilic, membrane-spanning, etc., receptor-ligand functions, 
and/or detectable functions, e.g., combined with enzyme, fluorescent polypeptide, 
green fluorescent protein, (Chalfie et al., 1994, Science, 263:802; Cheng et al., 1996, 
25 Nature Biotechnology, 14:606; Levy et al., 1996, Nature Biotechnology^ 14:610, etc. 
In addition, a polypeptide, or a part of it, can be used as a selectable marker when 
introduced into a host cell. For example, a nucleic acid coding for an amino acid 
sequence according to the present invention can be fused in frame to a desired coding 
sequence and act as a tag for purification, selection, or marking purposes. The region 

3 0 of fusion can encode a cleavage site to facilitate expression, isolation, purification, etc. 

A polypeptide according to the present invention can be produced in an 
expression system, e.g., in vivo, in vitro, cell-free, recombinant, cell fusion, etc.. 
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according to the present invention. Modifications to the polypeptide imparted by such 
systems include glycosylation, amino acid substitution (e.g., by differing codon usage), 
polypeptide processing such as digestion, cleavage, endopeptidase or exopeptidase 
activity, attachment of chemical moieties, including lipids and phosphates, etc. 
5 A polypeptide according to the present invention can be recovered from natural 

sources, transformed host cells (culture medium or cells) according to the usual 
methods, including, detergent extraction (e.g., Triton-X-100 CHAPS, octylglucoside), 
ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange 
chromatography, phosphocellulose chromatography, hydrophobic interaction 

10 chromatography, hydroxyapatite chromatography and lectin chromatography. Protein 
refolding steps can be used, as necessary, in completing the configuration of the mature 
protein. Finally, high performance liquid chromatography (HPLC) can be employed 
for purification steps. A corin polypeptide can also be isolated as described for other 
serine proteases, e.g., Wu et al., J. Biol Clients 267: 24408-24412, 1992; Wu et al., 

15 Proc. Natl. Acad. Sci., 88:6775-6779, 1991. 

A mammalian conn nucleic acid, or fragment thereof, is a nucleic acid having a 
nucleotide sequence obtainable from a natural source. See, above. It therefore 
includes naturally-occurring, normal, mutant, polymorphic alleles, degenerate 
sequences, etc. Natural sources include, e.g., living cells obtained from tissues and 

2 0 whole organisms, cultured cell lines, including primary and immortalized cell lines. 

Preferably, the following nucleic acid fragments from the Unigene database [PubEST] 
are excluded: Hs.62794 (AA126468 [1686098], AA126648 [1686206], AA625395 
[2537780], AA046682 [1524579], AA249850 [1881 137], and AA046793 [1524691]), 
and Hs.71798 (AA 147031 [1716421]). (The bracketed numbers refer to the PubEST 

2 5 database.) Optionally, the following fragments can also be excluded: Hs. 121626 

(AA771958), Hs.1657 (M69297), and Hs.47712 (AA203291), gl231787; gl312726; 
gl337948; and g942724. The nucleotide sequences of the aforementioned nucleic 
acids can be identified be searching publicly available databases. However, nucleic 
acids which contain or comprise these sequences are not excluded, e.g., full-length 

3 0 corin, or. a nucleic acid having two or more of these mentioned fragments. It is also 

preferred that the mentioned fragments in a cloning vector, such as a plasmid or phage, 
are excluded. 
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Human conn is expressed as an about a 5 kb mRNA. It is most abundant in 
heart. It is either absent, or expressed in very low levels in brain, placenta, lung, liver, 
skeletal muscle, kidney, pancreas, spleen, thymus, prostate, testis, ovary, small 
intestine, colon, bladder, uterus, and stomach. Consequently, corin can be used as a 
5 marker for the presence of heart tissue, e.g., in tissue sections (using corin-specific 
antibodies or corin-specific nucleic acid probes), in biopsied samples, etc. Corin was 
also detected in various human cell lines, including uterus tumor, osteosarcoma, 
endometrium carcinoma lines HEC-l-A, AN3 CA, and RL95-2, leiomyosarcoma SK- 
LM-1, and osteosarcoma U2-OS. Further details are described in the examples below. 

10 A nucleic acid sequence of a human corin allele having 4933 base pairs is 

shown in Seq. 1 . The size of the DNA is consistent with the length of corin mRNA (- 
5 kb) detected by Northern analysis. An ATG codon is located at position 95 that may 
represent the translation initiation site. The open reading frame (ORF) spans 3126 bp 
with a 5' untranslated region (UTR) of 94 nucleotides before the initiation codon. At 

15 the 3' end, there is a 1 .7-kb 3' UTR after the stop codon at position 3221. A 

polyadenylylation signal of AATAAA is present 12 nucleotides before the poly 
(A) + taiL A nucleic acid sequence of the invention can contain the complete coding 
sequence from amino acid 1 to amino acid 1042, degenerate sequences thereof, and 
fragments thereof. A nucleic acid according to the present invention can also comprise 

2 0 a nucleotide sequence which is 100% complementary, e.g., an anti-sense, to any 
nucleotide sequence mentioned above and below. 

The present invention also relates to a mouse nucleotide sequence coding for all 
or part of a corin, e.g., as shown in Seq. 3. As is the case for the human allele, the 
invention relates to degenerate sequences thereof, and anti-sense fragments thereof. 

2 5 Northern analysis shows a prominent transcript at about 5 kb in samples derived from 

the heart. In contrast to Northen analysis with human samples, low levels of mRNA 
were detected in samples obtained from mouse testes and kidneys. In situ 
hybridization with mouse corin mRNA was also detected in embryonic heart tissue, 
atrium and ventricular myocytes, developing kidneys, and cartilage-derived structures. 

3 0 Further details are described in the examples. 

A nucleic acid according to the present invention can be obtained from a 
variety of different sources. It can be obtained from DNA or RNA, such as 
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polyadenylated mRNA, e.g., isolated from tissues, cells, or whole organism. The 
nucleic acid can be obtained directly from DN A or RNA, or from a cDNA library. The 
nucleic acid can be obtained from a cell at a particular stage of development, having a 
desired genotype, phenotype (e.g., an embryonic or adult heart cell or tissue), etc. 
5 As described for corin polypeptides mentioned above, a nucleic acid 

comprising a nucleotide sequence coding for a polypeptide according to the present 
invention can include only coding sequence; a coding sequence and additional coding 
sequence (e.g., sequences coding for leader, secretory, targeting, enzymatic, fluorescent 
or other diagnostic peptides), coding sequences and non-coding sequences, e.g., 

1 0 untranslated sequences at either a 5 ! or 3' end, or dispersed in the coding sequence, e.g., 
introns. A nucleic acid comprising a nucleotide sequence coding without interruption 
for a polypeptide means that the nucleotide sequence contains an amino acid coding 
sequence for a corin, with no non-coding nucleotides interrupting or intervening in the 
coding sequence, e.g., absent intron(s). Such a nucleotide sequence can also be 

15 described as contiguous. A genomic DNA coding for a mammalian human or mouse 
corin, etc., can be obtained routinely. 

A nucleic acid according to the present invention also can comprise an 
expression control sequence operably linked to a nucleic acid as described above. The 
phrase "expression control sequence" means a nucleic acid sequence which regulates 

2 0 expression of a polypeptide coded for by a nucleic acid to which it is operably linked. 
Expression can be regulated at the level of the mRNA or polypeptide. Thus, the 
expression control sequence includes mRNA-related elements and protein-related 
elements. Such elements include promoters, enhancers (viral or cellular), ribosome 
binding sequences, transcriptional terminators, etc. An expression control sequence is 

2 5 operably linked to a nucleotide coding sequence when the expression control sequence 

is positioned in such a manner to effect or achieve expression of the coding sequence. 
For example, when a promoter is operably linked 5 1 to a coding sequence, expression 
of the coding sequence is driven by the promoter. Expression control sequences can be 
heterologous or endogenous to the normal gene. 

3 0 A nucleic acid in accordance with the present invention can be selected on the 

basis of nucleic acid hybridization. The ability of two single-stranded nucleic acid 
preparations to hybridize together is a measure of their nucleotide sequence 
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complementarity, e.g., base-pairing between nucleotides, such as A-T, G-C, etc. The 
invention thus also relates to nucleic acids which hybridize to a nucleic acid 
comprising a nucleotide sequence as set forth in Seq. 1 and Seq. 3. A nucleotide 
sequence hybridizing to the latter sequence will have a complementary nucleic acid 
5 strand, or act as a template for one in the presence of a polymerase (i.e., an appropriate 
nucleic acid synthesizing enzyme). The present invention includes both strands of 
nucleic acid, e.g., a sense strand and an anti-sense strand. 

Hybridization conditions can be chosen to select nucleic acids which have a 
desired amount of nucleotide complementarity with the nucleotide sequence set forth 

10 in Seq. 1 and Seq. 3. A nucleic acid capable of hybridizing to such sequence, prefer- 
ably, possesses, e.g., about 85%, more preferably, 90%, 92%, and even more 
preferably, 95%, 97%, or 100% complementarity, between the sequences. The present 
invention particularly relates to nucleic acid sequences which hybridize to the 
nucleotide sequence set forth in Seq. 1 and Seq. 3 under low or high stringency 

15 conditions. 

Nucleic acids which hybridize to corin sequences can be selected in various 
ways. For instance, blots (i.e., matrices containing nucleic acid) can be incubated in a 
prehybridization solution (6X SSC, 0.5% SDS, 100 ug/ml denatured salmon sperm 
DNA. 5X Denhardt's solution, and 50% formamide). at 30EC. overnight, and then 
2 0 hybridized with radiolabeled probes (corin) in a hybridrization solution (6X SSC, 0.5% 
SDS, 100 ug/ml denatured salmon sperm DNA and 50% formamide), at 42EC 
overnight in accordance with known procedures. Blots can be washed at high 
stringency conditions that allow, e.g., for less than 5% bp mismatch (e.g., wash twice 
in 0.1% SSC and 0.1% SDS for 30 min at 65EC), i.e., 95% or greater sequence 

2 5 identity. Whereas high stringency washes can allow for less than 5% mismatch, 

relaxed or low stringency wash conditions (e.g., wash twice in 0.2% SSC and 0.5% 
SDS for 30 min at 37EC) can permit up to 20% mismatch. Another non-limiting 
example of low stringency conditions includes a final wash at 42EC in a buffer 
containing 30 mM NaCl and 0.5% SDS. Another non-limiting example of high 

3 0 stringency conditions includes a final wash at 65EC in aqueous buffer containing 30 

mM NaCl and 0.5% SDS. Washing and hybridization can also be performed as 
described in Sambrook et aL, Molecular Cloning, 1989, Chapter 9. Hybridization can 
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also be based on a calculation of melting temperature (Tm) of the hybrid formed 
between the probe and its target, as described in Sambrook et al. Such stringent 
conditions can select sequences which have, e.g., at least about 95%, preferably 97%, 
nucleotide complementarity between the nucleic acids, with the proviso that such 
5 nucleic acid is not: Hs.62794 (AA126468 [1686098], AA126648 [1686206], 

AA625395 [2537780], AA046682 [1524579], AA249850 [1881137], and AA046793 
[1524691]), and Hs.71798 (AA147031 [1716421]). See, also above and below. 
Nucleic acids which contain or comprise these sequences are not excluded, e.g., full- 
length corin, or a nucleic acid having two or more of these mentioned fragments. 

10 According to the present invention, a nucleic acid or polypeptide can comprise 

one or more differences in the nucleotide or amino acid sequence set forth in Seqs.1-4. 

Changes or modifications to the nucleotide and/or amino acid sequence can be 
accomplished by any method available, including directed or random mutagenesis. 

A nucleic acid coding for a human or mouse corin according to the invention 

15 can comprise nucleotides which occur in a naturally-occurring corin gene e.g., 

naturally-occurring polymorphisms, normal or mutant alleles (nucleotide or amino 
acid), mutations which are discovered in a natural population of mammals, such as 
humans, monkeys, pigs, mice, rats, or rabbits. By the term naturally-occurring, it is 
meant that the nucleic acid is obtainable from a natural source, e.g., animal tissue and 

2 0 cells, body fluids, tissue culture cells, forensic samples. Naturally-occurring mutations 
can include deletions (e.g., a truncated amino- or carboxy-terminus), substitutions, or 
additions of nucleotide sequence. These genes can be detected and isolated by nucleic 
acid hybridization according to methods which one skilled in the art would know. A 
nucleotide sequence coding for a human or mouse corin polypeptide of the invention 

2 5 can contain codons found in a naturally-occurring gene, transcript, or cDNA, for 

example, e.g., as set forth in Seq. 1 or Seq. 3, or it can contain degenerate codons 
coding for the same amino acid sequences. For instance, it may be desirable to change 
the codons in the sequence to optimize the sequence for expression in a desired host. 
The present invention also relates to corin polypeptide muteins, i.e., any 

3 0 polypeptide which has an amino acid sequence which differs in amino acid sequence 

from an amino acid sequence obtainable from a natural source (a fragment of a 
mammal corin does not differ in amino acid sequence from a naturally-occurring 
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corin). Thus, conn muteins comprise amino acid substitutions, insertions, and 
deletions, including non-naturally occurring amino acids. See, e.g., Wu et al., Proc. 
Natl Acad. ScL, 88:7775-6779, 1991, especially reporting amino acid substitutions 
where a basic amino acid was substituted with glutamic acid, for guidance on making 
5 mutations. 

Muteins to a corin amino acid sequence of the invention can also be prepared 
based on homology searching from gene data banks, e.g., Genbank, EMBL. Sequence 
homology searching can be accomplished using various methods, including algorithms 
described in the BLAST family of computer programs, the Smith- Waterman algorithm, 
10 etc. 

A mutein(s) can be introduced into a sequence by identifying and aligning 
amino acids within a domain which are identical and/or homologous between 
polypeptides and then modifying an amino acid based on such alignment. For 
instance, sequence comparisons between human corin and the Frizzled, LDLR, and 

15 serine protease domains of other proteins are illustrated in Figs 2-4. These alignments 
reveal amino acid positions which are identical and also amino acid positions where 
the residues differ from each other but are homologous. Homologous amino acids can 
be defined based on the size of the side chain and degree of polarization, including, 
small nonpolar: cysteine, proline, alanine, threonine; small polar: serine, glycine, 

2 0 aspartate, asparagine; large polar: glutamate, glutamine, lysine, arginine; intermediate 
polarity: tyrosine, histidine, tryptophan; large nonpolar: phenylalanine, methionine, 
leucine, isoleucine, valine. 

Homologous acids can also be grouped as follows: uncharged polar R groups, 
glycine, serine, threonine, cysteine, tyrosine, asparagine, glutamine; acidic amino acids 

2 5 (negatively charged), aspartic acid and glutamic acid; basic amino acids (positively 

charged), lysine, arginine, histidine. Homologous amino acids also include those 
described by Dayhoff in the Atlas of Protein Sequence and Structure 5 (1978), and by 
Argos in EMBO J. , 8, 779-785 (1989). 

Muteins in accordance with the present invention include amino acid sequences 

3 0 where a residue in the human or mouse corin sequence is replaced by a homologous 

residue from a corresponding domain. 
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Thus, the present invention relates to a corin nucleotide sequence of Seq. 2 or 
4, wherein said nucleic acid codes for a polypeptide and one or more amino acid 
positions are substituted or deleted, or both, and the polypeptide coded for by the 
nucleic acid has a serine protease catalytic domain activity or an immunogenic activity 
5 specific for corin. Such nucleic acid can contain one or more substituted amino acid 
positions which are substituted by homologous amino acids. 

In addition, the sequence alignments as illustrated in Figs. 2-4 also provide 
information on amino acid substitutions that would be expected to reduce, decrease, or, 
eliminate a biological activity. For instance, where alignment reveals identical amino 
10 acids conserved between two or more domains (e.g.. replacing the conserved H, D, or 
S residues indicated in Fig. 4), elimination or substitution of the amino acid(s) would 
affect its biological activity. Mutations in the catalytic triad, namely, His910. Asp959, 
and Serl052 (mouse corin), abolish the catalytic activity of serine proteases. 

A corin polypeptide mutein, and its corresponding nucleotide coding sequence, 
15 can have an amino acid sequence as set forth in Seq. 2 or Seq. 4, except where one or 
more positions are substituted by homologous amino acids, e.g., where there are 1, 5, 
10, 15. or 20 substitutions. The invention also relates to mutein polypeptides and 
mutein nucleic acids coding for such polypeptides. 

A nucleic acid according to the present invention can comprise. e.g.,.DNA, 
2 0 RNA, synthetic nucleic acid, peptide nucleic acid, modified nucleotides, or mixtures. 
A DNA can be double- or single-stranded. Nucleotides comprising a nucleic acid can 
be joined via various known linkages, e.g., ester, sulfamate, sulfamide. 
phosphorothioate, phosphoramidate, methylphosphonate, carbamate, etc., depending 
on the desired purpose, e.g., resistance to nucleases, such as RNase H. improved in 

2 5 vivo stability, etc. See, e.g., U.S. Pat. No. 5,378,825. 

Various modifications can be made to the nucleic acids, such as attaching 
detectable markers (avidin. biotin, radioactive elements), moieties which improve 
hybridization, detection, or stability. The nucleic acids can also be attached to solid 
supports, e.g., nitrocellulose, magnetic or paramagnetic microspheres (e.g., as 

3 0 described in U.S. Pat. No. 5,41 1,863; U.S. Pat. No. 5,543,289; for instance, comprising 

ferromagnetic, supermagnetic. paramagnetic, superparamagnetic, iron oxide and 
polysaccharide), nylon, agarose, diazotized cellulose, latex solid microspheres, 
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polyacrylamides, etc., according to a desired method. See, e.g., U.S. Pat. Nos. 
5.470.967; 5,476,925; 5,478,893. 

Another aspect of the present invention relates to oligonucleotides and nucleic 
acid probes. Such oligonucleotides or nucleic acid probes can be used, e.g., to detect, 
5 quantitate, or isolate a mammalian corin nucleic acid in a test sample. In a preferred 
embodiment, the nucleic acids can be utilized as oligonucleotide probes, e.g., in PCR, 
in RACE, differential display, in combination with cDNA libraries, expression 
libraries, etc. Useful oligonucleotides are described below in the examples. Detection 
can be desirable for a variety of different purposes, including research, diagnostic, and 

1 0 forensic. For diagnostic purposes, it may be desirable to identify the presence or 

quantity of a such a nucleic acid sequence in a sample, where the sample is obtained 
from tissue, cells, body fluids, etc. In a preferred method, the present invention relates 
to a method of detecting a nucleic acid comprising, contacting a target nucleic acid in a 
test sample with an oligonucleotide under conditions effective to achieve hybridization 

15 between the target and oligonucleotide; and detecting hybridization. An 

oligonucleotide in accordance with the invention can also be used in synthetic nucleic 
acid amplification such as PCR (e.g.. Saiki et al., 1988, Science, 241:53; U.S. Pat. No. 
4,683.202; PCR Protocols: A Guide to Methods and Applications, Innis et al., eds., 
Academic Press, New York, 1990) or differential display (See. e.g., Liang et al., Nucl 

2 0 Acid Res.. 21:3269-3275, 1993; U.S. Pat. No. 5,599.672; W097/1 8454) or RACE. 
Such detection can be accomplished in combination with oligonucleotides for other 
genes, e.g., genes involved in cardiac tissue or bone development or function. 

Another aspect of the present invention is a nucleotide sequence which is 
unique to human corin or mouse corin. By a unique sequence to a corin, it is meant a 

2 5 defined order of nucleotides which occurs in corin, e.g., in the nucleotide sequence of 

Seq. 1 or 3, but rarely or infrequently in other nucleic acids, especially not in an animal 
nucleic acid, preferably mammal, such as human, rat, mouse, etc. Both sense and 
antisense nucleotide sequences are included. A unique nucleic acid according to the 
present invention can be determined routinely. A nucleic acid comprising such a 

3 0 unique sequence can be used as a hybridization probe to identify the presence of, e.g., 

human or mouse corin, in a sample comprising a mixture of nucleic acids, e.g., on a 
Northern blot. Hybridization can be performed under stringent conditions (see, above) 
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to select nucleic acids having at least 95% identity (i.e., complementarity) to the probe, 
but less stringent conditions can also be used. A unique conn nucleotide sequence can 
also be fused in-frame, at either its 5* or 3' end. to various nucleotide sequences as 
mentioned throughout the patent, including coding sequences for other parts of corin, 
5 enzymes, GFP, etc, expression control sequences, etc. 

Hybridization can be performed under different conditions, depending on the 
desired selectivity, e.g., as described in Sambrook et al.. Molecular Cloning, 1989. For 
example, to specifically detect human or mouse corin. an oligonucleotide can be 
hybridized to a target nucleic acid under conditions in which the oligonucleotide only 
10 hybridizes to it, e.g., where the oligonucleotide is 100% complementary to the target. 
Different conditions can be used if it is desired to select target nucleic acids which 
have less than 100% nucleotide complementarity, at least about, e.g., 99%, 97%, 95%, 
90%. 70%, 67%. 

Oligonucleotides of the present invention can comprise any continuous 
15 nucleotide sequence of Seq. 1 or Seq. 3. These oligonucleotides (nucleic acid) 
according to the present invention can be of any desired size, e.g., about 10-200 
nucleotides, 12-100, preferably 12-50, 12-25, 14-16, at least about 15, at least about 
20, etc. The oligonucleotides can have non-naturally-occurring nucleotides, e.g., 
inosine. The oligonucleotides can have 100% identity or complementarity to a 
2 0 sequence of Seq. 1 or Seq. 3, or it can have mismatches or nucleotide substitutions, 
e.g., 1. 2, 3, 4, or 5 substitutions. In accordance with the present invention, the 
oligonucleotide can comprise a kit, where the kit includes a desired buffer (e.g., 
phosphate, tris, etc.), detection compositions, etc. The oligonucleotide can be labeled 
or unlabeled, with radioactive or non-radioactive labels as known in the art. 

2 5 Anti-sense nucleic acid can also be prepared from a nucleic acid according to 

the present invention, preferably an anti-sense to a coding sequence of Seq. 1 or 3. 
Antisense nucleic acid can be used in various ways, such as to regulate or modulate 
expression of corin, e.g., inhibit it, to detect its expression, or for in situ hybridization. 
These oligonucleotides can be used analogously to U.S. Pat. No. 5,576,208. For the 

3 0 purposes of regulating or modulating expression of corin, an anti-sense oligonucleotide 

can be operably linked to an expression control sequence. 
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The nucleic acid according to the present invention can be labeled according to 
any desired method. The nucleic acid can be labeled using radioactive tracers such as 
32 P, : °S, l2 "I, 3 H, or 14 C. to mention some commonly used tracers. The radioactive 
labeling can be carried out according to any method such as, for example, terminal 
5 labeling at the 3' or 5* end using a radiolabeled nucleotide, polynucleotide kinase (with 
or without dephosphorylation with a phosphatase) or a ligase (depending on the end to 
be labeled). A non-radioactive labeling can also be used, combining a nucleic acid of 
the present invention with residues having immunological properties (antigens, 
haptens), a specific affinity for certain reagents (ligands), properties enabling 

1 0 detectable enzyme reactions to be completed (enzymes or coenzymes, enzyme 

substrates, or other substances involved in an enzymatic reaction), or characteristic 
physical properties, such as fluorescence or the emission or absorption of light at a 
desired wavelength, etc. 

A nucleic acid according to the present invention, including oligonucleotides, 

15 anti-sense nucleic acid, etc., can be used to detect expression of corin in whole organs, 
tissues, cells, etc.. by various techniques, including Northern blot, PCR, RACE, in situ 
hybridization, etc. Such nucleic acids can be particularly useful to detect disturbed 
expression, e.g., cell-specific and/or subcellular alterations, of corin. The levels of 
corin can be determined alone or in combination with other gene products, especially 

2 0 cardiac specific gene products. 

A nucleic acid according to the present invention can be expressed in a variety 
of different systems, in vitro and in vivo, according to the desired purpose. For 
example, a nucleic acid can be inserted into an expression vector, introduced into a 
desired host, and cultured under conditions effective to achieve expression of a 

2 5 polypeptide coded for by the nucleic acid. Effective conditions include any culture 
conditions which are suitable for achieving production of the polypeptide by the host 
cell, including effective temperatures, pH, medias. additives to the media in which the 
host cell is cultured (e.g., additives which amplify or induce expression such as 

30 

butyrate, or methotrexate if the coding nucleic acid is adjacent to a dhfr gene), 
cycloheximide. cell densities, culture dishes, etc. A nucleic acid can be introduced into 
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the cell by any effective method including, e.g., naked DN A, calcium phosphate 
precipitation, electroporation, injection, DEAE-Dextran mediated transfection, fusion 
with liposomes, association with agents which enhance its uptake into cells, viral 
transfection. A cell into which a nucleic acid of the present invention has been 
5 introduced is a transformed host cell. The nucleic acid can be extrachromosomal or 
integrated into a chromosome(s) of the host cell. It can be stable or transient. An 
expression vector is selected for its compatibility with the host cell. Host cells include, 
mammalian cells, e.g., COS-7, CV1, BHK, CHO, HeLa, LTK, NIH 3T3, yeast, insect 
cells, such as Sf9 (S. frugipeda) and Drosophila, bacteria, such as E. coli, 

10 Streptococcus, bacillus, yeast, fungal cells, plant cells, embryonic stem cells (e.g., 
mammalian, such as mouse or human), bone cells (such as, osteoclasts or 
chondrocytes), cardiac cells (e.g., from a primary culture), muscle cells, neuronal cells, 
etc. Expression control sequences are similarly selected for host compatibility and a 
desired purpose, e.g., high copy number, high amounts, induction, amplification, 

15 controlled expression. Other sequences which can be employed include enhancers 

such as from SV40, CMV, RSV, inducible promoters, cell-type specific elements, or 
sequences which allow selective or specific cell expression. Promoters that can be 
used to drive its expression, include, e.g., the endogenous promoter, MMTV, SV40; 
trp, lac. tac, or T7 promoters for bacterial hosts: or alpha factor, alcohol oxidase, or 

2 0 PGH promoters for yeast. 

Another gene of interest can be introduced into the same host for purposes of, 
e.g., modulating conn function. Such genes can be the normal gene, or a variation, e.g., 
a mutation, chimera, polymorphism, etc. 

A nucleic acid or polypeptide of the present invention can be used as a size 

2 5 marker in nucleic acid or protein electrophoresis, chromatography, etc. Defined 

restriction fragments can be determined by scanning the sequence for restriction sites, 
calculating the size, and performing the corresponding restriction digest. A corin 
cDNA as shown in Seq. 1 can be used as a 4.8 kb molecular weight marker in nucleic 
acid electrophoresis. 

3 0 The human corin cDNA as shown in Seq. 1 maps to human chromosome 

position 4pl2-13. It can thus be used as a marker in pedigree mapping. For example, 
this chromosomal region appears to segregate with total anomalous pulmonary venous 
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retum (TAPVR) and therefore could be used to map the disease in association with 
other genetics markers. See. e.g., Bleyl et aL„ Am. J. Hum. Genet.. 56: 408-415, 1995. 

Another aspect of the present invention relates to the regulation of biological 
pathways in which a corin gene, or gene product, is involved, particularly pathological 
5 and developmental conditions, and to the diagnosis of such conditions by detecting 
either corin polypeptide or nucleic acid. For example, ANF is involved in a variety of 
physiological processes related to the cardiovascular and kidney systems, including, 
e.g., body fluid homeostasis, blood pressure, vasodilation, natriuresis, inhibition of 
sodium absorption in the glomerular duct, increased glomerular filtration, inhibition of 

10 aldosterone production and secretion, mitogenesis, etc. The regulation of the pro- ANF 
converting activity of corin can therefore have profound effects on an organism=s 
physiology. For instance, ventricular levels of ANF are normally low in ventricular 
myocytes but increase dramatically with cardiovascular diseases. In patients with 
congestive heart failure, plasma levels of ANF are high. The high concentrations are 

15 correlated with the severity of ventricular dysfunction. Similarly, high concentrations 
of ANF are associated with cardiac arrhythmias and hemodynamic compromise. See, 
e.g.. Levin et aL, New Engl. J. Med., 339:321-328, 1998. Abnormally high levels can 
be reduced by inhibiting the activity of corin, e.g., transcriptionally by inhibiting gene 
expression or by administering enzyme inhibitors, which act directly on catalytic 

2 0 activity. 

Since corin is highly expressed in cartilage-derived and cardiac cells, it may be 
involved in diseases associated with these tissues, such as familial hypertrophic 
cardiomyopathy, osteopetrosis, and osteoporosis-pseudoglioma syndrome, 
osteoporosis, Paget=s disease, osteitis deformans, and total anomalous pulmonary 
25 venous return. As mentioned, corin oligonucleotides can be utilized in pedigree 

mapping to study the familial inheritance of these diseases, in analogy to TAPVR, and 
for diagnostic purposes (e.g., prenatal) if corin is associated (gene linkage or causative) 
with such a disease. Corin may also be involved heart development, developmental 
pathways which involve signaling with Wnt proteins and other growth factors, cell 

3 0 differentiation (e.g., chondrocyte differentiation), and related processes. In addition, 

corin maybe involved in cell-cell signaling, differentiation (bone and/or cardiac), and 
other developmental pathways. This role may be mediated by serine protease catalytic 
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activity, frizzled domains, and/or the LDRL domains. Corin may also be involved in 
the processing (e.g., by catalytic cleavage of a progrowth factor) of growth factors, 
such as BMP or TGF-p. 

Along these lines, a further aspect of the invention relates to the treatment of 
5 cardiovascular and kidney disease, such as hypertension, congestive heart disease, or 
renal failure (as well as pathological conditions related to any to other pathway in 
which corin participates), comprising the administration, to a host in need thereof, of 
an effective amount of an agent which modulates the activity of a mammalian corin. 
By the term Amodulates,= it is meant: increasing, agonizing, promoting, stabilizing, 

1 0 decreasing, reducing, antagonizing, blocking, inhibiting, etc. The nature of the desired 
modulatory effect can be determined routinely, e.g., on whether the pathological effect 
is produced by elevated or diminished quantities of ANF. The activity can be inhibited 
by administering an agent which directly blocks the catalytic activity of corin (e.g., an 
enzyme inhibitor, such as leupeptin or diisopropyl fluorophosphate) or by an agent, 

15 such as antisense, which blocks transcription or translation of the corresponding gene. 
Corin activity can be increased, e.g., by administering a corin gene which is effective 
to produce corin polypeptide. Corin gene can be administered in analogy to other gene 
therapies. See, e.g., Magovern et al, Hum. Gen. Ther., 8:215-227, 1997; Springer et 
al.Mol Cell, 2:549-558, 1998. 

2 0 The present invention also relates to methods of modulating the activity of 

corin in any environment, including in vitro and in vivo, either by directly modulating 
its activity, or by modulating expression of the gene which encodes it. Enzyme activity 
can be measured conventionally, e.g., as described in Wu et al., J. Biol Chem., 
267:24408-24412, 1992; Wu et al., Proc. Natl. Acad. Sci., 88:6775-6779, 1991. 
25 Converting activity can be measured as described in the examples, or, e.g., as described 
by Inagami, J. Biol Chem., 264:3043-3046, 1989. The invention thus relates to a 
method of identifying modulators of the aforementioned activity of a corin 
polypeptide, fragment, or mutein thereof, comprising: reacting, in the presence of a test 
compound, a corin polypeptide, a fragment thereof, and a substrate for serine protease 

3 0 or the converting enzyme, under conditions effective for said polypeptide or fragment 

thereof to cleave said substrate; detecting said cleavage; and identifying whether the 
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test compound modulates said serine protease activity by comparing the amount of 
cleavage in the presence and absence of the test compound. 

Any substrate is suitable, and, any means for detection, such as a chromogenic, 
fluorogenic, radioactive, electrophoretic, etc. can be utilized. For example, the 
5 products of cleavage can be detected using labeled substrates in combination with gel 
electrophoresis. In a preferred example, the substrate is a chromogenic substrate, i.e., a 
peptide that reacts with the serine protease and which is designed to possess a 
selectivity similar to that of the natural substrate for the enzyme. Attached to the 
peptide part of the chromogenic substrate is a chemical group which when released 

1 0 after the enzyme cleavage gives rise to a detectable color. The color change can be 
followed spectrophotometrically and is proportional to the proteolytic activity of the 
serine protease catalytic domain. These substrates can be routinely synthesized. 
Hydrolysis of substrate (e.g., containing 4-nitroaniline, pNA, as the chromophore) can 
be measured at room temperature by following absorbance at 405 nm. Substrates 

15 which can be used to measure amidolytic activity of serine proteases include, S2302 
(H-D-Pro-Phe-Arg-pNA.2HCl), S2444 (pyroGlu-Gly-Arg-pNA.HCl), and S2288 (H- 
D-Ile-Pro-Arg-pNA.2HCl), respectively. Other chromophores that can be incorporated 
into the substrate include, e.g., ANBA, ADMP, thiolester derivatives, etc. See, e.g., 
Chromogenix catalogs and its websites for further information. Values for K m and k cat 

2 0 can be calculated by the Michaelis-Menton equation. 

The invention also relates to a method of identifying compounds which bind 
specifically to a corin polypeptide, comprising: contacting a corin polypeptide with a 
test compound under conditions effective for said test compound to bind specifically to 
said corin polypeptide; and detecting binding to said corin polypeptide. Binding assays 

2 5 can be performed conventionally. The corin polypeptide can comprise the complete 
coding sequence, or fragments thereof, especially, one or more LDLR domains, 
frizzled domains, SRCR, or combinations thereof. See, above, for amino acid 
positions of such domains. The domains can be utilized as fusion proteins, e.g., in 
combination with immunoglobulin domains. See, e.g., U.S. Pat. No. 5,639,597. 
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The corin component can be added to the reaction mixture in a variety forms, 
e.g., substantially purified, as a component of a cell, as a soluble extract, or as a lysate. 
In each case, the corin polypeptide can be obtained from a natural source, a 
recombinant source (i.e., a "recombinant" polypeptide is a produced by genetic 
5 engineering, e.g., introduced into a cell line on a plasmid, vector, naked DNA, etc., and 
expressed in the cell), or it can be produced synthetically (produced chemically or 
enzymatically, e.g., cleavage of a full-length corin at amino acid residue 801, e.g., 
using a fragment of 802-1042). The corin polypeptide can be expressed in a 
mammalian cell, an insect cell line, or in bacteria, e.g., as a fusion or non-fusion 
1 0 protein. 

Preferably, corin is expressed in a cell line transformed with a corin coding 
sequence (e.g., a cDNA. a gene, a genomic fragment, etc.). In the latter case, the corin 
is present as a heterologous component of the cell; by heterologous, it is meant that the 
corin is coded for by a coding sequence that has been introduced by the hand of a 
15 person into the cell, e.g., by transfection, transformation, etc. Preferably, the corin is 
expressed at high levels in the cell (bacterial, yeast, insect, mammalian, etc.). A 
human corin is a preferred coding sequence. See. e.g., Seq.l and Seq. 2. 

In a preferred aspect of the invention, the corin is provided as a cell lysate, e.g., 
cells transformed with human corin are lysed and the resulting lysate is used directly in 
2 0 the assay, i.e., a crude lysate. The crude lysate comprising the recombinant human 

corin can optionally be refined or enriched for human corin. For instance, membrane 
fractions can be isolated conventionally. 

Corin can also be modulated at earlier steps in its expression pathway, e.g., 
modulating its transcription, mRNA stability, translation, post-translational 

2 5 modifications, processing (such as cleavage at the internal cleavage site), etc. 

Expression can be regulated using different agents, e.g., an antisense nucleic acid, a 
ribozyme, an aptamer, a synthetic compound, or a naturally-occurring compound. 

Compounds identified in this or other manners can be useful to modulate corin 
activity in a cell, a tissue, a whole organism, in situ, in vitro (test tube, a solid support, 

3 0 etc.), in vivo, or in any desired environment. In general, a compound having such an in 

vitro activity will be useful in vivo to modulate a biological pathway associated with 
corin, e.g., to treat a pathological condition associated with the biological and cellular 
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activities mentioned above, including bone and cardiac diseases and developmental 
disorders thereof, growth factor and protein maturation via serine protease activity, etc. 

To treat a disease, the compound, or mixture, can be formulated into a 
pharmaceutical composition comprising a pharmaceutically acceptable carrier and 
other excipients as apparent to the skilled worker. See, e.g., Remington's 
Pharmaceutical Sciences, Eighteenth Edition, Mack Publishing Company, 1990. Such 
composition can additionally contain effective amounts of other compounds. 

The present invention also relates to a method of modulating, preferably 
inhibiting, expression of a gene coding for a mammalian corin, comprising: contacting 
a cell expressing a mammalian corin of the present invention, such as a human or 
mouse corin, with an amount of agent, such as an antisense oligonucleotide or 
antisense RNA of a human or mouse corin gene, which is effective to sequence- 
specifically inhibit said gene. Inhibiting expression of a corin gene can inhibit the 
maturation of ANF, and have consequent impact on the pathways in which ANF is 
involved, including those mentioned above. 

Sequence-specific inhibition of a gene can be accomplished conventionally 
using antisense nucleic acid, such as antisense oligonucleotides or RNA. For example, 
antisense oligonucleotides, such as phosphodiester or phosphorothioate 
deoxyoligonucleotides can be designed to specific regions of a corin RNA, such as to 
the translation initiation site, and can then be administered to cells expressing such 
genes in quantities effective to inhibit their expression. Generally, an antisense nucleic 
acid is a nucleic acid which is complementary to the sense or coding strand of a given 
gene, and as a result are also complementary and thus able to specifically hybridize 
with mRNA transcripts of the gene. 

To enhance stability, the administered nucleic acid can be modified, e.g., to 
make it resistant to cellular enzymes, oxidation, reduction, nucleases, etc, or to 
enhance its uptake into cells. Any suitable modification can be used, including, e.g., 
phosporothioates, methylphosphonates, phosphodiester oligonucleotide linked to an 
acridine intercalating agent and/or a hydrophobic tail, psoralen derivatives, 2-ribose 
modifications, pentose sugar derivatives, nitrogen base derivatives, etc. See, e.g., U.S. 
Pat. No. 5,576,208 and U.S. Pat. No. 5,744,362. See, above, for other derivatives, 
modifications, etc. which can be useful in the invention. In general, an antisense 
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nucleic acid of the present invention can comprise monomers of naturally-occurring 
nucleotides, non-naturally-occurring nucleotides, and combinations thereof to enhance 
cellular uptake and/or stability. 

Antisense can be administered as naked nucleic acid, complexed or 
5 encapsulated with and by other agents which facilitate its uptake into a cell, injected 
into cells, or any suitable delivery means in association with vectors, such as viral or 
adeno. Antisense gene therapy can be accomplished by any suitable method. See, e.g., 
Phillips, Am. J. Cardiol, 82:605-625, 1998; Haller et al. Kidney Int., 53:1550-1558, 
1998. 

10 The present invention also relates to antibodies which specifically recognize 

corin. An antibody specific for corin means that the antibody recognizes a defined 
sequence of amino acids within or including a corin, e.g., the human and murine 
sequences of Seq. 2 and Seq. 3. Thus, a specific antibody will generally bind with 
higher affinity to an amino acid sequence, i.e., an epitope, found in Seq. 2 than to a 

15 different epitope(s), e.g., as detected and/or measured by an immunoblot assay or other 
conventional immunoassay. Thus, an antibody which is specific for an epitope of 
human corin is useful to detect the presence of the epitope in a sample, e.g., a sample 
of tissue containing human corin gene product, distinguishing it from samples in which 
the epitope is absent. Such antibodies are useful as described in Santa Cruz 

2 0 Biotechnology, Inc., Research Product Catalog, and can be formulated accordingly, 
e.g., 100 fig/ml. 

Antibodies, e.g., polyclonal, monoclonal, recombinant, chimeric, humanized, 
can be prepared according to any desired method. See, also, screening recombinant 
immunoglobulin libraries (Orlandi et al., Proc. Natl. Acad. Sci., 86: 3833-3837, 1989; 

2 5 Huse et al., Science, 256: 1275-1281, 1989); in vitro stimulation of lymphocyte 

populations; Winter and Milstein, Nature, 349: 293-299, 1991 . For example, for the 
production of monoclonal antibodies, a polypeptide according to Fig. 2 or Fig. 3 can be 
administered to mice, goats, or rabbit subcutaneously and/or intraperitoneally, with or 
without adjuvant, in an amount effective to elicit an immune response. The antibodies 

3 0 can also be single chain or FAb fragments. The antibodies can be IgG, subtypes, 

IgG2a, IgGl, etc. Antibodies, and immune responses, can also be generated by 
administering naked DNA See, e.g., U.S. Pat. Nos. 5,703,055; 5,589,466; 5,580,859. 
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Corin. or fragments thereof, for use in the induction of antibodies do not need 
to have biological activity; however, they must have immunogenic activity. Peptides 
for use in the induction of corin-specific antibodies may have an amino sequence 
consisting of at least five amino acids, preferably at least 10 amino acids. Short 
5 stretches of corin amino acids, e.g., five amino acids, can be fused with those of 

another protein such as keyhole limpet hemocyanin. or another useful carrier, and the 
chimeric molecule used for antibody production. 

Several different approaches, as mentioned, can be utilized to prepare 
antibodies specific for corin. For instance, in one approach, denatured corin from 

10 purified corin (e.g., purified by reverse-phase HPLC separation) is obtained in 

quantities up to 75 mg. This denatured protein can be used to immunize mice or rabbits 
using standard protocols; about 100 micrograms are adequate for immunization of a 
mouse, while up to 1 mg might be used to immunize a rabbit. For identifying mouse 
hybridomas, the denatured protein can be radioiodinated and used to screen potential 

15 murine B-cell hybridomas for those which produce antibody. This procedure requires 
only small quantities of protein, such that 20 mg would be sufficient for labeling and 
screening of several thousand clones. 

In another approach, an amino acid sequence of corin, as deduced from the 
cDNA, is analyzed to determine regions of high immunogenicity. Polypeptides 

2 0 comprising these regions are synthesized and used in suitable immunization protocols 
to raise antibodies. Analysis to select appropriate epitopes is described by Ausubel FM 
et al (1989, Current Protocols in Molecular Biology?, Vol 2. John Wiley & Sons). The 
optimal amino acid sequences for immunization are usually at the C-terminus, the N- 
terminus and those intervening, hydrophilic regions of the polypeptide which are likely 

2 5 to be exposed to the external environment when the protein is in its natural 

conformation. Typically, selected peptides, about 15 residues in length, are 
synthesized using an Applied Biosystems Peptide Synthesizer Model 431 A using 
fmoc-chemistry and coupled to keyhole limpet hemocyanin (KLH, Sigma) by reaction 
with M- maleimidobenzoyl-N-hydroxysuccinimide ester (MBS; cf. Ausubel FM et al, 

3 0 supra). If necessary, a cysteine may be introduced at the N-terminus of the peptide to 

permit coupling to KLH. Rabbits are immunized with the peptide-KLH complex in 
complete Freund's adjuvant. The resulting antisera are tested for antipeptide activity by 
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binding the peptide to plastic, blocking with 1% BSA. reacting with antisera, washing 
and reacting with labeled (radioactive or fluorescent), affinity purified, specific goat 
anti-rabbit IgG.. 

Antibodies can also be generated against any desired region, or subregion 
5 thereof, e.g, about, or comprising, amino acids 46-66; 134-259; 450-573; 268-415; 
579-690; 713-801; 802-1402. 

Hybridomas can also be prepared and screened using standard techniques. 
Hybridomas of interest are detected by screening with labeled corin to identify those 
fusions producing the monoclonal antibody with the desired specificity. In a typical 

10 protocol, wells of plates (FAST, Becton-Dickinson. Palo Alto, Calif.) are coated with 
affinity purified, specific rabbit-anti-mouse (or suitable anti-species Ig) antibodies at 
10 mg/ml. The coated wells are blocked with 1% BSA, washed and exposed to 
supernatants from hybridomas. After incubation the wells are exposed to labeled corin, 
1 mg/ml. Clones producing antibodies will bind a quantity of labeled corin which is 

15 detectable above background. Such clones are expanded and subjected to 2 cycles of 
cloning at limiting dilution (1 cell/3 wells). Cloned hybridomas are injected into 
pristine mice to produce ascites, and monoclonal antibody is purified from mouse 
ascitic fluid by affinity chromatography on Protein A. Monoclonal antibodies with 
affinities of at least 10 8 M, preferably 10 9 to 10 ,n , or stronger, will typically be made by 

2 0 standard procedures as described in Harlow and Lane ( 1 988) Antibodies: A Laboratoiy 
Manual, Cold Spring Harbor Laboratory, or Goding (1986) Monoclonal Antibodies: 
Principles and Practice, 2 nd Ed. Academic Press N.Y. 

Particular corin antibodies are useful for the diagnosis of prepathologic 
conditions, and chronic or acute diseases which are characterized by differences in the 

2 5 amount or distribution of corin. Diagnostic tests for corin include methods utilizing 

the antibody and a label to detect corin in human (or mouse, etc, if using mouse, etc.) 
body fluids, tissues or extracts of such tissues (such as in heart and cartilage tissues). 

The polypeptides and antibodies of the present invention may be used with or 
without modification. Frequently, the polypeptides and antibodies will be labeled by 

3 0 joining them, either covalently or noncovalently, with a substance which provides for a 

detectable signal. A wide variety of labels and conjugation techniques are known and 
have been reported extensively in both the scientific and patent literature. Suitable 
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labels include radionuclides, enzymes, substrates, cofactors. inhibitors, fluorescent 
agents, chemiluminescent agents, magnetic particles and the like. Patents teaching the 
use of such labels include U.S. Pat. Nos. 3,817,837; 3.850,752; 3,939.350; 3,996,345; 
4,277,437; 4,275,149; and 4,366,241. Also, recombinant immunoglobulins maybe 
5 produced as shown in U.S. Pat. No. 4,8 16,567, incorporated herein by reference. 

A variety of protocols for measuring soluble or membrane-bound corin, using 
either polyclonal or monoclonal antibodies specific for corin are known in the art. 
Examples include enzyme-linked immunosorbent assay (ELISA), radioimmunoassay 
(RIA) and fluorescent activated cell sorting (FACS). A two-site monoclonal-based 

10 immunoassay utilizing monoclonal antibodies reactive to two non-interfering epitopes 
on EC is preferred, but a competitive binding assay may be employed. These assays are 
described, among other places, in Maddox, Dei, et al ( 1983) J Exp Med 158: 1211. 

Antibodies and other ligands which bind corin can be used in various ways, 
including as therapeutic, diagnostic, and commercial research tools, e.g. to quantitate the 

15 levels of corin polypeptide in animals, tissues, cells, etc., to identify the cellular 
localization and/or distribution of it, to purify it, or a polypeptide comprising a part of it, 
to modulate the function of it, in Western blots, ELISA, immunoprecipitation, RIA, etc. 
The present invention relates to such assays, compositions and kits for performing them, 
etc. Utilizing these and other methods, an antibody according to the present invention can 

2 0 be used to detect corin polypeptide or fragments thereof in various samples, including 
tissue, cells, body fluid, blood, urine, cerebrospinal fluid. A method of the present 
invention comprises: a) contacting a ligand which binds to a peptide of Seq. 2 or Seq. 3 
under conditions effective, as known in the art, to achieve binding, and b) detecting 
specific binding between the ligand and peptide. By specific binding, it is meant that the 

2 5 ligand attaches to a defined sequence of amino acids, e.g., within or including the amino 
acid sequence of Seq. 2 or Seq. 3 or derivatives thereof. 

Native or recombinant corin can be purified by immunoaffmity chromatography 
using corin-specific antibodies. In general, an immunoaffmity column is constructed by 
covalently coupling the anti-corin antibody to an activated chromatographic resin. 
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Polyclonal immunoglobulins are prepared from immune sera either by 
precipitation with ammonium sulfate or by purification on immobilized Protein A 
(Pharmacia LKB Biotechnology, Piscataway, N J.). Likewise, monoclonal antibodies are 
prepared from mouse ascites fluid by ammonium sulfate precipitation or chromatography 
5 on immobilized Protein A. Partially purified Ig is covalently attached to a 
chromatographic resin such as CnBr activated Sepharose (Pharmacia LKB Biotechnology, 
Piscataway, N.J.)- The antibody is coupled to the resin, the resin is blocked, and the 
derivative resin is washed according to the manufacturer's instructions. 

An immunoaffinity column is utilized in the purification of corin by preparing a 

10 fraction from cells containing corin. This preparation can be derived by solubilization of 
the whole cell or of a subcellular fraction obtained via differential centrifugation by the 
addition of detergent or by other methods well known in the art. Alternatively, soluble 
corin containing a signal sequence may be secreted in useful quantity into the medium in 
which the cells are grown. 

15 A soluble corin-containing preparation is passed over the immunoaffinity column, 

and the column is washed under conditions, e.g., high ionic strength buffers in the 
presence of detergent, that allow the preferential absorbance of corin. Then, the column 
is eluted under conditions that disrupt antibody/corin binding (e.g., a buffer of pH 2-3 or 
a high concentration of a chaotrope such as urea or thiocyanate ion), and the corin is 

2 0 collected. 

In addition, ligands which bind to a corin polypeptide according to the present 
invention, or a derivative thereof, can also be prepared, e.g., using synthetic peptide 
libraries or aptamers (e.g., Pitrung et al., U.S. Pat. No. 5,143,854; Geysen et al., 1987, J. 
Immunol Methods, 102:259-274; Scott et al., 1990, Science, 249:386; Blackwell et al., 
25 1990. Science, 250:1 104; Tuerk et al., 1990, Science, 249: 505.) 

The antibodies or derivatives thereof can also be used to inhibit expression of 
corin or a fragment thereof. The levels of corin polypeptide can be determined alone or 
in combination with other gene products. In particular, the amount (e.g., its expression 
level) of corin polypeptide can be compared (e.g., as a ratio) to the amounts of other 

3 0 polypeptides in the same or different sample, e.g., actin. In general, reagents which are 

specific for corin can be used in diagnostic and/or forensic studies according to any 
desired method, e.g., as U.S. Pat. Nos. 5,397,712; 5,434,050; 5,429,947. 
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The present invention also relates to a conn polypeptide, prepared according to a 
desired method, e.g., as disclosed in U.S. Pat. No. 5.434,050. A labeled polypeptide can 
be used, e.g.. in binding assays, such as to identify substances that bind or attach to corin, 
to track the movement of corin in a cell, in an in vitro, in vivo, or in situ system, etc. 
5 A nucleic acid, polypeptide, antibody, corin. ligand etc., according to the present 

invention can be isolated. The term "isolated" means that the material is in a form in 
which it is not found in its original environment, e.g., more concentrated, more purified, 
separated from component, etc. An isolated nucleic acid includes, e.g., a nucleic acid 
having the sequence of corin separated from the chromosomal DNA found in a living 

10 animal. This nucleic acid can be part of a vector or inserted into a chromosome (by 
specific gene-targeting or by random integration at a position other than its normal 
position) and still be isolated in that it is not in a form which it is found in its natural 
environment. A nucleic acid or polypeptide of the present invention can also be 
substantially purified. By substantially purified, it is meant that nucleic acid or 

15 polypeptide is separated and is essentially free from other nucleic acids or polypeptides, 
i.e., the nucleic acid or polypeptide is the primary and active constituent. 

The present invention also relates to a transgenic animal, e.g., a non-human- 
mammal, such as a mouse, comprising a corin or a corin knock-out. Transgenic animals 
can be prepared according to known methods, including, e.g., by pronuclear injection of 

2 0 recombinant genes into pronuclei of 1-cell embryos, incorporating an artificial yeast 

chromosome into embryonic stem cells, gene targeting methods, embryonic stem cell 
methodology. See, e.g., U.S. Patent Nos. 4,736,866; 4,873,191; 4,873,316; 5,082,779; 
5,304,489; 5,174,986; 5,175,384; 5,175,385; 5,221.778; Gordon et aL, Proc. Natl. Acad. 
ScL, 77:7380-7384 (1980); Palmiter et aL, Cell, 41 :343-345 (1985); Palmiter et aL, Ann. 
25 Rev. Genet., 20:465-499 (1986); Askew et aL, Mol. Cell Bio., 13:4115-4124, 1993; 
Games et aL Nature, 373:523-527, 1995; Valancius and Smithies, Mol. Cell Bio., 
11:1402-1408, 1991; Stacey et aL, Mol. Cell. Bio.. 14:1009-1016, 1994; Hasty et aL, 
Nature, 350:243-246, 1995; Rubinstein et aL, NucL Acid Res., 21:2613-2617,1993. A 
nucleic acid according to the present invention can be introduced into any non-human 

3 0 mammal, including a mouse (Hogan et aL, 1986, in Manipulating the Mouse Embryo: A 

Laboratory Manual Cold Spring Harbor Laboratory, Cold Spring Harbor, New York), pig 
(Hammer et aL. Nature. 315:343-345, 1985), sheep (Hammer et aL, Nature, 315:343-345, 
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1985), cattle, rat, or primate. See also, e.g.. Church, 1987, Trends in Biotech. 5:13-19; 
Clark et ah, 1987, Trends in Biotech. 5:20-24; and DePamphilis et al., 1988, 
BioTechniques. 6:662-680. In addition, e.g., custom transgenic rat and mouse production 
is commercially available. These transgenic animals are useful animals models to test for 
5 corin function, as food for a snake, as genetic markers to detect strain origin, etc. Such 
transgenic animals can further comprise other transgenes or knock-outs thereof (e.g., in 
other serine proteases, natriuretic peptides, etc.). 

Transgenic animals comprising multiple copies of the corin gene, the corin gene 
driven by strong promoters, or corin knockouts, can be useful as animal models for 

10 hypertension, renal disease, and cardiac disease. See, e.g., Steinhelper et al., 
Hypertension. 16:301-307, 1990; John et al.. Am. J. Physiol. , 271:R109-R1 14, 1996. 

The present invention thus relates to a transgenic animal, such as a rodent, a 
mouse, or a rat, comprising cells which contain a recombinant corin gene integrated 
into a chromosome of said cell at the native corin gene locus, said recombinant corin 

15 gene comprising a nucleotide coding sequence which codes for a recombinant corin 
polypeptide comprising at least one amino acid whose identity and/or position is not 
naturally-occurring in said native corin gene. Such a recombinant gene can be 
produced by homologous recombination, e.g., between a human corin and the corin 
gene of the host animal at its native locus. 

2 0 The present invention also relates to a transgenic animal, such as a mouse or a 

rat, comprising cells which contain at least one functionally disrupted recombinant 
corin gene at a chromosomal corin gene locus, wherein said disruption prevents 
functional expression of the corin polypeptide coded for by the corin gene. Functional 
inactivation or disruption refers, e.g., to a partial or complete reduction of the 

2 5 expression of at least a portion of a polypeptide encoded by an endogenous serine 

protease gene of a single cell, selected cells, or all of the cells of a mammal. Such 
reduction can result in concomitant reduction in pro-ANF -converting enzyme activity, 
causing a decrease in physiologically-active ANF. The term "knockout" is a synonym 
for functional inactivation of the gene. 

3 0 In one embodiment, a gene targeting strategy is utilized that facilitates the 

introduction of a desired nucleotide sequence into a corin gene. The gene targeting 
strategy preferably utilizes double reciprocal recombination and a positive selectable 
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marker to assist in the insertion of the nucleotide sequence into a target nucleic acid. 
The target nucleic acid is preferably a gene, more preferably a gene at its particular 
chromosomal locus. The desired nucleotide sequence is inserted into the gene in such 
a way that the gene is functionally disrupted, i.e., its expression is partially or 
5 completely reduced. 

Modification of the corin gene atis chromosomal locus can be accomplished 
according to anysuitable method, such as homologous recombination. The latter can 
be used to disrupt the corin gene, introduce a gene mutation into the corin gene, insert 
or delete DNA, etc. Selection and use of sequences effective for homologous 

10 recombination is described, e.g., in Deng and Capecchi, Mol. Cell Bio., 12:3365-3371, 
1992; Bollag et al., Annu. Rev. Genet., 23:199-225, 1989; Waldman and Liskay, Mol. 
Cell Bio., 8:5350-5357, 1988; Rubinstein et al., Nucl Acid Res., 21:2613-2617, 1993; 
WO94/23049; W095/14377. 

The corin gene can be completely functionally disrupted, e.g., by deleting 5' 

15 regions of the gene. Alternatively, specific regions of corin can be modified. For 
instance, the catalytic domain at about amino acid positions 802-1042 in the human 
corin gene, or corresponding positions in the mouse gene, can be genetically altered by 
recombination to produce a corin with modified activity, including null activity, 
increased activity, reduced activity. 

2 0 Generally, the nucleic acids, polypeptides, antibodies, etc. of the present invention 

can be prepared and used as described in, U.S. Pat. Nos. 5,501,969, 5,506,133, 5,441,870; 
WO 90/00607; WO 91/15582. 

For other aspects of the nucleic acids, polypeptides, antibodies, etc., reference is 
made to standard textbooks of molecular biology, protein science, and immunology. See, 

2 5 e.g., Davis et al. (1986), Basic Methods in Molecular Biology, Elsevir Sciences 

Publishing, Inc., New York; Hames et al. (1985), Nucleic Acid Hybridization, IL Press, 
Molecular Cloning, Sambrook et al.; Current Protocols in Molecular Biology, Edited by 
F.M. Ausubel et al., John Wiley & Sons, Inc; Current Protocols in Human Genetics, 
Edited by Nicholas C. Dracopoli et al., John Wiley & Sons, Inc.; Current Protocols in 

3 0 Protein Science; Edited by John E. Coligan et al., John Wiley & Sons, Inc.; Current 

Protocols in Immunology; Edited by John E. Coligan et al., John Wiley & Sons, Inc. 



SUBSTITUTE SHEET (RULE 26) 



WO 99/64608 



PCT/EP99/03895 



-36- 
EXAMPLES 

Isolation of human corin cDNA clones: 

A partial EST sequence was identified based on analysis of Incyte EST 
5 database for novel serine protease cDNAs. The clone (307474) was ordered from 
Incyte. A 2.1 kb EcoRl-Xhol fragment from the clone was used to screen a human 
heart cDNA library (Clontech). Phagemid 14b2, which contains a 3.8 kb insert, was 
obtained from one positive phage clone by in vivo exision. Oligo primers which 
derived from phagemid 14b2 were used to further clone 5=-end cDNA sequence by 
10 5 T RACE, using Marathon-ready human heart cDNA (Clontech) as templates. The 
PCR products were cloned into pCRII vector (Invitrogen) and sequenced. The full- 
length (i.e., having an initiation codon and a termination codon) human corin cDNA 
sequence was obtained by compiling sequences obtained from 5'-RACE and 
phagemid using GCG DNA sequence analysis package. 

15 

Oligo primers used in 5' RACE 

PrWY109: 5'-CAGTTGGTTTGAACAAGTGCAGGG-3' 
PrWYl 10: 5'-TGCAAGGAGGGATACGCTCGCCTG-3' 
PRWY1 11: 5'-AATCCCAAGAACAGACTCACAGCG-3' 
2 0 PRWY1 18: 5'-CGGGTCACAGAGAGAGCTACCACC-3' 
PRWY1 19: 5'-GGTCTCCTTCTTGACATGAATCTG-3' 

5 -AACAAAACGATCCTTGGAGGTCGGACGAGT-3' 

Northern Analysis: 

2 5 The 2. 1 kb-EcoRI-XhoI fragment of Incyte clone 307474 was labeled by 32 P- 

dCTP using a random primer labeling kit (Boehringer). Human Multiple Tissue 
Northern Blot I, Human Multiple Tissue Northern Blot IL Human Muscle Northern 
Blot filters (Clontech) were hybridized with the labeled human corin cDNA probe. 
The Northern hybridization was carried out overnight at 42EC with 40% formamide, 

3 0 5X Denhardts solution, 6 X SSC, 100 |ig/ml salmon sperm DNA, 0.1% SDS. The 

filters were washed with 0.2 X SSC, 0.1% SDS at 60EC and exposed to Fuji imaging 
plates. 



SUBSTITUTE SHEET (RULE 26) 



WO 99/64608 



PCT/EP99/03895 



-37- 



Cloning of Mouse Corin cDNA: 

Mouse corin cDNA clones were isolated by a PCR-based strategy. A mouse 
heart cDNA library was purchased from Clontech and used as templates for PCR 
5 amplification (30 cycles of 1 .5-min annealing at 55EC 1 .5-min extension at 72EC, 
and 1-min denaturation at 94EC). The sequences of PCR primers are based on the 
human corin cDNA sequences. Primers used for amplification of the mouse cDNA 
include: 

Cor09 : 5 '-TCTTCTGTGT ACT AAAC A AG ACTG-3 ' 

1 o Corl2: 5'-AGGCCCCAGGACTTTGGAAAAGCA-3' 
Cor02: 5 AC AGTGGCCTGAAGAC AC AG ATTG-3 ' 

Prwyl28: 5'-ACAGAGCATCGCTGCGGGGACGGG-3' 

DNA fragments from the PCR reactions were cloned into the pCRII vector. Plasmid 
15 clones that were derived from independent PCR reactions were used for further 
sequencing. Mouse and human corin cDNAs share over 85% sequence identities. 



In situ Hybridization: 

RT-PCR-mRNA samples were isolated from Hec-l-A. U2-OS, SK-LMS-K 
2 0 and AN3-CA cells using a commercial RNA preparation kit (Oligotex Direct mRNA 
Mini Kits, QIAGEN). First strand cDNAs were synthesized using Superscript II 
RNase- reverse transcriptase (Life Technologies). Human corin specific 
oligonucleotide primers (sense primer: 

5'-AACAAAAGGATCCTTGGAGGTCGGACGAGT-3' and antisense primer: 

2 5 5'-CGGAGCCCCATGA AGTTAATCCA-3') were used to amplify a 630-bp 

fragment of corin cDNA between nucleotides 2475 and 3105. Oligonucleotide 
primers TFR1 (5'-GTCAATGTCCCAAACGTCACCAGA-3') and TFR2 
(S'-ATTTCGGGAATGCTGAGAAAACAGACAGA-S'), derived from the human 
glyceraldehyde-3 -phosphate dehydrogenase (GAPDH) gene, were used as an internal 

3 0 quantification control. PCR reactions were performed with a thermal cycler 

(Perkin-Elmer, model 480). PCR products were separated on 1% agarose gels and 
visualized by ethidium bromide staining. 
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/// Situ Hybridization-Mouse adult heart and embryonic tissue sections were 
deparaffinized in xylene, rehydrated and fixed in 4% paraformaldehyde. The tissues 
were digested with proteinase K (20 mg/ml), then treated with triethanolamine/acetic 
5 anhydride and dehydrated. An 800-bp mouse corin cDNA fragment from the coding 
region was cloned into pCRU (Invitrogen) in two orientations to yield plasmids pMl 1 
and pM41 . The plasmids were linearized by Hindin digestion. Sense and antisense 
probes were synthesized using T7 RNA polymerase (T7/SP6 transcription kit, 
Boehringer Mannheim) and labeled with [33P]UTP (Amersham). The hybridization 
10 was carried out as described (14). The slides were dehydrated and dipped in Kodak 
NTB-2 emulsion and exposed for 4 weeks in light-tight boxes at 4°C. Photographic 
development was carried out in a Kodak D-19 developer. The slides were stained 
with hemotoxin and eoisin and analyzed using both light- and dark-field optics of a 
Zeiss microscope. 

15 Fluorescent In Situ Hybridization (FISH) Analysis— PI phage clones 

containing the human corin gene were isolated by filter hybridization using a human 
corin cDNA as the probe. One clone was confirmed by DNA sequencing using a 
primer from human corin cDNA. The DNA fragment from this PI phage was labeled 
with digoxigenin-dUTP. The labeled probe was combined with sheared human DNA 

2 0 and hybridized to metaphase chromosomes derived from PHA-stimulated peripheral 

blood lymphocytes in a solution containing 50% formamide, 10% dextran sulfate and 
2 x SSC. Hybridization signals were detected by fluorescent-labeled antidigoxigenin 
antibodies and counter-staming with DAPI (4,6-diaminoidino-2-phenylindole). A 
total of 80 metaphase cells were analyzed of which 74 cells exhibited specific 
25 labeling. 

Homology Model of the Protease Domain of Corin— A model of the corin 
protease domain (amino acids 802-1042) was built based on the structure of bovine 
chymotrypsinogen A at 1.8 A resolution (15,16), using the Homology program 
(Insight II, 1995, MSI, San Diego, CA). Rotamers were used for non-identical 

3 0 side-chain replacements (16). Coordinates for the loop insertions were extracted 

from the Brookhaven protein data bank (17). The model was refined by energy 
minimization using the AMBER force field (Discover 95.0), with a 
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distance-dependent dielectric constant. The minimization used the steepest descents 
and conjugate gradient methods: first for the loops only where insertions and 
deletions occurred, then side-chains, and a final round of minimization keeping the 
Ca atoms fixed. The residues of corin (His843, Asp892 and Ser985) corresponding 
5 to the catalytic triad of the template structure were also held fixed. 



Biological activity of corin 

Several different cell lines were produced by conventional methods to assay 

for the biological activity of corin. 293 cells were cotransfected with pro-ANF and 
1 0 corin expressing plasmids. Two additional cell lines were created by cotransfecting 

293 cells with a pro-ANF expressing plasmid and a hepsin or prothrombin expressing 

plasmid. Only the cells expressing corin converted pro-ANF to ANF as demonstrated 

by Western blot using antibodies to ANF. 

Another experiment was conducted in which conditioned media containing 
15 pro-ANF was contacted with a cells expressing recombinant corin polypeptide. 

Again, pro-ANF was converted to ANF. Control cells expressing hepsin or 

prothrombin had no effect on pro-ANF. 

Without further elaboration, it is believed that one skilled in the art can, using 
2 0 the preceding description, utilize the present invention to its fullest extent. The 
preceding preferred specific embodiments are, therefore, to be construed as merely 
illustrative, and not limitative of the remainder of the disclosure in any way 
whatsoever. 

The entire disclosure of all applications, patents, publications, nucleic acids 
2 5 having sequence identification numbers, etc., cited above and in the figures are 
hereby incorporated in their entirety by reference, including the following nucleic acid 
fragments from the Unigene, PubEST, and GenbBank databases: Hs.62794 
(AA126468 [1686098], AA126648 [1686206], AA625395 [2537780], AA046682 
[1524579], AA249850 [1881137], and AA046793 [1524691]), and Hs.71798 
30 (AA147031 [1716421]); Hs.121626 (AA771958), Hs.1657 (M69297), and Hs.47712 
(AA203291), gl231787; gl312726; gl337948; and g942724. 
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From the foregoing description, one skilled in the art can easily ascertain the 
essential characteristics of this invention, and without departing from the spirit and 
scope thereof, can make various changes and modifications of the invention to adapt 
it to various usages and conditions. 
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CLAIMS 

What is claimed : 

1 . An isolated full-length mammalian conn polypeptide, or a biologically-active 
polypeptide fragment thereof, with the proviso that said polypeptide does not consist 

5 of Hs.62794 (AA126468 [1686098], AA126648 [1686206], AA625395 [2537780], 
AA046682 [1524579], AA249850 [1881137], or AA046793 [1524691]), or 
Hs.71798 (AA147031 [1716421]). 

2. An isolated full-length mammalian corin, or a biologically-active fragment 
10 thereof, of claim 1, wherein said polypeptide has a serine protease catalytic activity 

domain or an immunogenic activity specific for corin. 

3. An isolated full-length mammalian corin, or a biologically-active fragment 
thereof, of claim 1, which is human. 

15 

4. An isolated full-length mammalian corin, or a biologically-active fragment 
thereof, of claim 1 . which is mouse. 

5. An isolated full-length mammalian corin of claim 1 which is human, 
2 0 comprising amino acid 1 to amino acid 1042 as set forth in Seq. 2. 

6. An isolated mammalian corin of claim 1 which is mouse, comprising amino 
acid 1 to amino acid 1 1 1 3 as set forth in Seq. 4. 

2 5 7. An isolated full-length mammalian corin of claim 1 which is human, coded 
for by the DNA sequence set forth in Seq. 1 . 

8. An isolated full-length mammalian corin of claim 1 which is mouse, coded 
for by the DNA sequence set forth in Seq. 3. 

30 
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9. An isolated full-length mammalian corin of claim I which is human, coded 
for by a naturally obtainable nucleic acid sequence which hybridizes under high 
stringency conditions to the DNA sequence set for forth in Seq. 1 . 

5 10. An isolated mammalian corin of claim 1 which is mouse, coded for by a 
naturally obtainable nucleic acid sequence which hybridizes under high stringency 
conditions to the DNA sequence set for forth in Seq. 3. 

11. An isolated nucleic acid comprising a nucleotide sequence coding for a 
10 mammalian full-length corin polypeptide, or a biologically-active polypeptide thereof, 

with the proviso that the nucleic acid does not consist of Hs. 62794 (AA 126468 
[1686098], AA126648 [1686206], AA625395 [2537780], AA046682 [1524579], 
AA249850 [1881137], or AA046793 [1524691]), or Hs.71798 (AA147031 
[1716421]), or any of said nucleic acids present in a cloning vector. 

15 

12. An isolated nucleic acid of claim 11, wherein said coded for polypeptide has 
a serine protease catalytic activity or an immunogenic activity specific for corin. 

13. An isolated nucleic acid of claim 1 1 which is human. 

20 

14. An isolated nucleic acid of claim 11, wherein the nucleotide sequence codes 
for amino acid 1 to amino acid 1042 as set forth in Seq. 2. 

15. An isolated nucleic acid of claim 11, wherein the nucleotide sequence codes 
2 5 for amino acid 1 to amino acid 1113 as set forth in Seq. 4. 



16. An isolated nucleic acid of claim 1 1 having the nucleotide sequence set forth 
in Seq 1 . 

3 0 17. An isolated nucleic acid of claim 1 1 having the nucleotide sequence set forth 
in Seq. 3. 
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1 8. An isolated nucleic acid consisting essentially of any continuous sequence of 
12-100 base pairs, or a complement thereto, selected from the nucleotide sequence 
set forth in Seq. 1. 

5 19. An isolated nucleic acid consisting essentially of any continuous sequence of 
12-100 base pairs, or a complement thereto, selected from the nucleotide sequence 
set forth in Seq. 3. 

20. An isolated nucleic acid sequence of claim 1 8, further comprising a detectable 
1 0 label. 

21 . An isolated nucleic acid of claim 1 8, having at least one but not more than 
five, nucleotide substitutions in said sequence, and which hybridizes under high 
stringency conditions to the nucleotide sequence set forth in Seq. 1 , or a complement 

15 thereto. 

22. An isolated nucleic acid of claim 11, having a naturally-obtainable nucleotide 
sequence which hybridizes under high stringency stringent conditions to the 
nucleotide sequence set forth in Seq. 1 , with the proviso that said nucleic acid does 

2 0 not consist of Hs.62794 (AA126468 [1686098], AA126648 [1686206], AA625395 
[2537780], AA046682 [1524579], AA249850 [1881 137], or AA046793 [1524691]), 
orHs.71798 (AA147031 [1716421]). 

23. An isolated nucleic acid of claim 11, having a naturally-obtainable nucleotide 
2 5 sequence which hybridizes under high stringency conditions to the DNA acid 

sequence set forth in Seq. 3, with the proviso that said nucleic acid does not consist 
of Hs.62794 (AA126468 [1686098], AA126648 [1686206], AA625395 [2537780], 
AA046682 [1524579], AA249850 [1881137], or AA046793 [1524691]), or 
Hs.71798 (AA147031 [1716421]). 

30 

24. An isolated nucleic acid of claim 1 1, wherein the nucleotide sequence is 
operably linked to an expression control sequence. 



SUBSTITUTE SHEET (RULE 26) 



WO 99/64608 



PCT/EP99/03895 



-44- 

25. An isolated nucleic acid of claim 1 1, wherein the nucleic acid codes for said 
polypeptide without interruption. 

26. An isolated nucleic acid of claim 1 1 , wherein the nucleic acid further 
comprises a detectable label. 

27. An isolated nucleic acid of claim 22, comprising at least 95% nucleotide 
sequence identity to the nucleotide sequence set forth in Seq. 1, with the proviso that 
the nucleic acid does not consist of Hs.62794 (AA126468 [1686098], AA126648 
[1686206], AA625395 [2537780], AA046682 [1524579], AA249850 [1881 137], or 
AA046793 [1524691]), or Hs.71798 (AA147031 [1716421]). 

28. A method of expressing in transformed host cells, a human corin coded for 
by a nucleic acid, comprising: 

culturing transformed host cells containing a nucleic acid according to claim 
9 under conditions effective to express the polypeptide. 

29. A method of claim 28, wherein said host cells are mammalian. 

30. A method of claim 28, further comprising isolating the membrane fraction of 
said host cells comprising said polypeptide. 

3 1 . An isolated polypeptide produced by a method of claim 28. 

32. A transformed host cell containing a nucleic acid of claim 7. 

33. A vector comprising a nucleic acid of claim 7. 

34. A method of identifying modulators of serine protease catalytic activity of a 
mammalian corin polypeptide activity comprising: 
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reacting, in the presence of a test compound, a mammalian corin polypeptide, 
and substrate for serine protease, under conditions effective for said polypeptide to 
cleave said substrate; 

detecting said cleavage; and 

identifying whether the test compound modulates said serine protease activity 
by comparing the amount of cleavage in the presence and absence of the test 
compound. 

35. A method of claim 34, wherein said substrate is chromogenic, and cleavage 
of said substrate results in the appearance of a detectable color. 

36. A method of claim 34. wherein said mammalian corin polypeptide is 
recombinant. 

37. A method of claim 34, wherein said mammalian corin polypeptide has the 
sequence of amino acid 1 to amino acid 1042 as set forth in Seq. 2. 

38. A method of claim 34, wherein said mammalian corin polypeptide has the 
sequence of the nucleic acid sequence set forth in Seq. 1. 

39. A method of claim 34. wherein said mammalian corin polypeptide has the 
sequence of amino acid 1 to amino acid 1 1 13 as set forth in Seq. 4. 

40. A method of claim 34, wherein said mammalian corin polypeptide has the 
sequence of the nucleic acid sequence set forth in Seq. 3. 

41. A method of claim 34, wherein the substrate is pro-ANF. 

42. An isolated antibody which is specific for a human or mouse corin. 

43. An isolated antibody of claim 42. which binds to an amino acid sequence 
selected from Seq. 2. 
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44. An isolated antibody of claim 42, which binds to an amino acid sequence 
selected from Seq. 4. 
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SEQUENCE LISTING 

<110> Morser, John 
Wu, Qingyu 
Yan, Wei 

<120> NOVEL SERINE PROTEASE NUCLEIC ACIDS AND POLYPEPTIDES 

<130> SCHERING AG 

<140> 
<141> 

<150> 09/092, 029 
<151> 1998-06-05 

<160> 18 

<17G> ?a:en::n Ver . 2.0 

<210> 1 
<211> 4933 
<212> DNA 

<213> Homo sapiens corin 
<400> 1 

aaatcatccg cagtgcctcc ccgggggaca cgtagaggag agaaaagcga ccaagataaa 60 
agtggacaga agaataagcg agacttttta tccatgaaac agtctcctgc cctcgctccg 120 
gaagagcgct accgcagagc cgggtcccca aagccggtct tgagagctga tgacaataac 180 
atgggcaatg gcrgctctca gaagctggcg actgctaacc tcctccggtt cctattgctg 240 
gtcctgattc catgtatctg tgctctcgtt ctcttgctgg tgatcctgct ttcctatgtt 300 
ggaacactac aaaaggtcta ttttaaatca aatgggagtg aacctttggt cactgatggt 360 
gaaatccaag ggtccgatgt tattctraca aatacaattt ataaccagag cactgtggtg 420 
tctactgcac atcccgacca acacgttcca gcctggacta cggatgcttc tcrcccaggg 480 
gaccaaagtc acaggaatac aagtgcctgt atgaacatca cccacagcca gtgtcagatg 540 
ctgccctacc acgccacgct gacacctctc ctctcagttg tcagaaacat ggaaatggaa 600 
aagttccrca agtttttcac atatctccat cgcctcagtt gctatcaaca taccatgctg 660 
tttggctgta cccucgcctt ccctgagtgc atcattgatg gcgatgacag ecatggacrc 720 
ctgcccrgta ggtccttctg tgaggcrgca aaagaaggct gtgaatcagt cctggggacg 780 
gtgaattact cctggccgga tttcctcaga tgctcccagt ttagaaacca aactgaaagc 840 
agcaatgtca gcagaatttg cttctcacct cagcaggaaa acggaaagca attgctctgt 900 
ggaaggggtg agaactttct gtgtgccagt ggaatctgca tccccgggaa actgcaatgt 960 
aatggctaca acgactgtga cgactggagt gacgaggctc attgcaactg cagcgagaat 1020 
ctgtttcact gtcacacagg caagtgcctt aattacagcc ttgtgtgtga tggatatgat 1080 
gactgtgggg atttgagtga tgagcaaaac tgtgattgca atcccacaac agagcatcgc 1140 
tgcggggacg ggcgctgcat cgccatggag tgggtgtgtg atggtgacca cgactgtgtg 1200 
gataagrccg acgaggtcaa ctgctcctgt cacagccagg gtctggtgga atgcagaaat 1260 
ggacaatgta tccccagcac gtttcaatgt gatggtgacg aggactgcaa ggargggagt 1320 
gatgaggaga actgcagcgt cattcagact tcatgtcaag aaggagacca aagatgcctc 1380 
tacaatccct gccttgattc atgtggrggr agctctctct grgacccgaa caacagtcrg 1440 
aataactgta grcaatgtga accaattaca ttggaactct gcatgaattt gccctacaac 1500 
agtacaagtt atccaaatta ttttggccac aggactcaaa aggaagcatc catcagctgg 1560 
gagtcttctc ttttccctgc acttgttcaa accaactgtt ataaatacct catgttcttt 1620 
tcttgcacca ttttggtacc aaaatgtgat gtgaatacag gcgagcgtat ccctccttgc 1680 
agggcattgt gtgaacactc taaagaacgc tzgtgagtcrg rtcttgggac tgtgggccta 1740 
cagtggcctg aagacacaga ttgcagtcaa tttccagagg aaaattcaga caatcaaac: 1800 
tgcctgatgc crgaugaata tgtggaagaa tgctcaccta gtcattrcaa gtgccgctca 1860 
ggacagrgtg tzczgqczzz cagaagatgt gatggccagg ccgacrgnga cgatgacagr 1920 
gatgaggaaa acrgtggtrg taaagagaga gatctttggg aacgtccatc caataaacaa 1980 
tgtttgaagc acacagtgat ctgcgarggg ttcccagact gccctgatta catggacgag 2040 
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aaaaactgci 
cgtgacctgt 
gtgaccctct 
gaacaccatg 
atgggtttag 
tggctgacat 
gtaaatgggc 
tgtgggcgcc 
cctggaaggt 
tgtgtcctca 
aatgctgcag 
atgcagacac 
gactatgaca 
cctgtctgct 
ggctggggcc 
atttctctgg 
tgtgctggct 
gtttgtgaga 
cgcrtttcca 
attaaaagac 
acittigcca 
gagctgtaca 
tgtttgtttt 
aagttcaatg 
aaccatgact 
aacaggttat 
gattaattat 
gttaacattg 
acacttcgaa 
tgccatgttg 
tttacatgtt 
gataaaagac 
caaagttctt 
atatcaagat 
ttcaggatat 
tattttgttt 
atgagagatg 
tttttcagtt 
acagtaagac 
ctggtttctt 
acgcaactca 
taaaacacaa 
aacaactagg 
agatactacc 
gaaatt caaa 
aacctaagga 
agaacatatg 
tgctcaactt 
aaataaaatg 



catt:tgcca 
ggtgtgatgg 
ctacaaatgt 
tgtgtgcaga 
gagaaccatc 
uacactccaa 
agtcttgtga 
gccct'gctgc 
ggccatggca 
ttgccaagaa 
tttggaaagt 
gctttgtgaa 
tcagcatcgr 
tgcccaaccc 
acatgggcaa 
aacattgtca 
atgagtctgg 
agcctggagg 
aagt cctggg 
agatttacat 
gciiacactaa 
gaagcacttt 
ggactaartt 
aaagacttta 
gcagcacaaa 
ggaatggtct 
ctctaccagt 
gagacttgct 
ttttagggta 
gtgtgctgaa 
taaagtattt 
agcaataaaa 
tagatatcaa 
cataatttta 
atgagttggc 
tacttatgtg 
caattttnta 
ttcttgcgtt 
tttgcatgga 
ctcattggta 
atttctgaag 
atcttcatat 
attccctact 
tgcaatttta 
aagrtgtagc 
gagtgtgaaa 
tcagttttgt 
aggctcagtg 



agatgatgag 
tgaagccgac 
gaactcctct 
tggctggcag 
tgtgaccaaa 
ctgggagagc 
gagcagaagt 
ccgaatgaac 
gtgttctctg 
gtgggttctg 
ggtgcttggc 
gaccatcatc 
tgagctgagt 
ggagcagtgg 
taaaatgcca 
gtcctacttt 
cacagttgat 
acggtggaca 
gcctggcgtt 
ccagaccrr: 
aagaaaatgg 
tcatggacag 
ttutcaaiiLt 
caaaagcaaa 
attatcgact 
cttttatcct 
ttttgtttct 
tttctttttc 
cacgagcaca 
cccagtaact 
actattaccc 
gaccagtgtc 
tattaacact 
tagaagagtc 
tgattgagac 
gcaaagaact 
aaaagaaaat 
catccacacc 
gtaaaacatt 
attaaaattt 
caattagtgg 
atacttaatt 
aacctccacc 
tacatgtatt 
aatttctata 
tccagcaact 
tacagttgta 
aaatatatat 
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ctggaatgtg 
cgcicagaca 
tcctttctga 
gagatattga 
ttgatacagg 
ctcaatggga 
aaaatttctc 
aaaaggatcc 
cagagtgaac 
acagttgccc 
atcaacaatc 
ctgcatcccc 
gaagacatca 
ctagagcctg 
tttaagctgc 
gacatgaaga 
tcatgcatgg 
ttatttggat 
tatagtaatg 
ctccraaact 
cctrcttgac 
aaatgctcaa 
attttttcac 
caaagcagac 
ctggcgagat 
atcacaaaaa 
caagctcagt 
tttttttata 
acgtgcaggt 
cgtcatttga 
ccttctaatg 
atccatttag 
tgacattatt 
tctatagaac 
tgcaacaact 
ggatattaaa 
taatttgcat 
aacaaagtca 
ttgtaatttt 
tagaaatgat 
taaaaagtat 
taattagtca 
agcaacctgg 
tttgtatctt 
ctattcatct 
gaattgtggt 
gctacatact 
attatactta 



caaaccatgc 
gtucagaiga 
tggttcacag 
gtcagctggc 
aacaggagaa 
ccaccrtaca 
ttctgtgtac 
rtggaggtcg 
ccagtggaca 
actgc ttcga 
tagaccatcc 
gctacagtcg 
gtgagactgg 
acacgtactg 
aagagggaga 
ccatcaccac 
gtgacagcgg 
taactt catg 
t o t c a t a utt 
aattataagg 
tgtgaagagc 
tcgtgcactg 
cttcattttt 
t it greet tt 
itaaaatcag 
aagacataga 
gcatagtggt 
ccccacaatt 
tagttacata 
tttattaaaa 
tttgcataat 
gtagcaagac 
ggacccccca 
tgtcctcata 
acatctatat 
czttgcaaaa 
ccctcgttta 
taaagagcat 
cctcaaaaga 
ttttagctct 
ttttccccac 
ggcatccatt 
actgcctcag 
ttctgtgtgt 
cctgtccttc 
cacgactgta 
caatgtatca 
ttttaaataa 



gtgtgtgt ca 
atgggactgt 
agctgccaca 
ctgcaagcag 
agagecgegg 
tgaacttcta 
taaacaagac 
gaegagtege 
tatctgtggc 
ggggagagag 
atcagtgttc 
agcagtggtg 
ctacgtccgg 
ctatatcaca 
ggtcegcatt 
teggatgata 
tgggectett 
gggctccgt c 
cgt cgaat gg 
atgatcagag 
tgectgeaga 
caaatttgea 
ctctcatttc 
tgccaggcct 
gtgctacagt 
tatttaggct 
aaatttcagt 
cttttttatt 
tgtatacatg 
gecaagacaa 
tctgagaact 
atattgaatg 
ttctggatgt 
gctgggtttg 
ttatgggcaa 
gagaatttag 
atiaaattra 
attttagagc 
tgtttaatat 
aggecacttt 
taaaaaactt 
ttgectttta 
cattccaaat 
aaacatagtt 
agtttgtata 
tgaaagttca 
acttttagee 
ttcttaatac 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
258C 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4 2 60 
4 320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4933 



<210> 2 
<211> 1042 
<212> PRT 

<213> Homo sapiens ccrin 



<400> 2 

Met Lys Gin Ser Pro Ala Leu Ala Pro Glu Glu Arg Tyr Arg Arg Ala 

i 5 10 



Gly Ser Pro Lys Pro Vai Leu Arg Ala Asp Asp Asn Asn Met Giy Asr 
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20 25 30 

Gly Cys Ser Gin Lys Leu Ala Thr Ala Asn Leu Leu Arg Phe Leu Leu 
35 40 45 

Leu Val Leu lie Pro Cys lie Cys Ala Leu Val Leu Leu Leu Val lie 
50 55 60 

Leu Leu Ser Tvr Val Glv Thr Leu Gin Lys Val Tyr Phe Lys Ser Asn 
65 * 70 75 80 

Gly Ser Glu Pro Leu Val Thr Asp Gly Glu He Gin Gly Ser Asp Val 
8 5 90 95 

He Leu Thr Asn Thr He Tyr Asn Gin Ser Thr Val Val Ser Thr Ala 
100 105 HO 

His Pro Asp Gin His Val Pro Ala Trp Thr Thr Asp Ala Ser Leu Pre 

115 120 125 

Gly Asp Gin Ser His Arg Asn Thr Ser Ala Cys Met Asn He Thr His 
130 135 140 

Ser Gin Cys Gin Met Leu Pro Tyr His Ala Thr Leu Thr Pro Leu Leu 
145 150 155 160 

Ser Val Val Arcr Asn Met Glu Met Glu Lys Phe Leu Lys Phe Phe Thr 
165 170 175 

Tyr Leu His Arg Leu Ser Cys Tyr Gin His lie Met Leu Phe Gly Cys 
180 185 190 

Thr Leu Ala Phe Pro Glu Cys He He Asp Gly Asp Asp Ser His Gly 
195 200 205 

Leu Leu Pro Cvs Arg Ser Phe Cys Glu Ala Ala Lys Glu Gly Cys Glu 
210 215 220 

Ser Val Leu Gly Met Val Asn Tvr Ser Trp Pro Asp Phe Leu Arg Cys 
225 230 235 240 

Ser Gin Phe Arg Asn Gin Thr Glu Ser Ser Asn Val Ser Arg He Cys 
245 250 255 

Phe Ser Pro Gin Gin Glu Asn Gly Lys Gin Leu Leu Cys Gly Arg Gly 
260 265 270 

Glu Asn Phe Leu Cvs Ala Ser Gly He Cys He Pro Gly Lys Leu Gin 
275 280 285 

Cys Asn Glv Tyr Asn Asp Cys Asp Asp Trp Ser Asp Glu Ala His Cys 
2 90 295 300 

Asn Cvs Ser Glu Asn Leu Phe His Cys His Thr Gly Lys Cys Leu Asn 
305 ' 310 315 320 

Tvr Ser Leu Val Cvs Asc Glv Tvr Asd Asp Cys Gly Asp Leu Ser Asp 
32 5 ' 330 335 

Glu Gin Asn Cys Asp Cys Asn Pro Thr Thr Glu His Arg Cys Gly Asp 
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340 

Gly Arg Cys He 
355 

Val Asp Lys Ser 
370 

Val Glu Cys Arg 
385 

Gly Asp Glu Asp 



He Gin Thr Ser 
420 

Cys Leu Asp Ser 
4 35 

Leu Asn Asn Cys 
450 

Asn Leu Pro Tyr 
465 

Thr Gin Lys Glu 



Leu Val Gin Thr 
500 

He Leu Val Pro 
515 

Cys Arg Ala Leu 
530 

Gly He Val Gly 
545 

Pro Glu Glu Asn 



Val Glu Glu Cys 
580 

Val Leu Ala Ser 
595 

Ser Asp Glu Glu 
610 

Pro Ser Asn Lys 
625 

Pro Asp Cys Pro 



Asp Asp Glu Leu 



Ala Met Glu Tro 
3 60 

Asp Glu Val Asn 
375 

Asn Gly Gin Cys 
390 

Cys Lys Asp Gly 
4 05 

Cys Gin Glu Gly 



Cys Gly Gly Ser 
4 4 C 

Ser Gin Cys Glu 
455 

Asn Ser Thr Ser 
470 

Ala Ser He Ser 
485 

Asn Cys Tyr Lys 



Lys Cys Asp Val 
520 

Cys Glu His Ser 
535 

Leu Gin Trp Pro 
550 

Ser Asp Asn Gin 
565 

Ser Pro Ser His 



Arg Arg Cys Asp 
600 

Asn Cys Gly Cys 
615 

Gin Cys Leu Lys 
630 

Asp Tyr Met Asp 
645 

Glu Cys Ala Asn 
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345 

Val Cys Asp Gly 



Cys Ser Cys His 
38 0 

He Pro Ser Thr 
395 

Ser Asp Glu Glu 
410 

Asp Gin Arg Cys 
425 

Ser Leu Cys Asp 



Pro He Thr Leu 
460 

Tyr Pro Asn Tyr 
475 

Trp Glu Ser Ser 
490 

Tyr Leu Met Phe 
505 

Asn Thr Gly Glu 



Lys Glu Arg Cys 
54 0 



Glu Asp Thr Asp 
555 

Thr Cys Leu Met 
570 

Phe Lys Cys Arg 
585 

Gly Gin Ala Asp 



Lys Glu Arg Asp 
620 

His Thr Val lie 
635 

Glu Lys Asn Cys 
650 

His Ala Cys Val 



350 

Aso His Asp Cys 
3 65 

Ser Gin Gly Leu 



Phe Gin Cys Asp 
4 00 

Asn Cys Ser Val 
415 

Leu Tyr Asn Pro 
430 

Pro Asn Asn Ser 

445 

Glu Leu Cys Met 



Phe Gly His Arg 
480 

Leu Phe Pro Ala 
495 

Phe Ser Cys Thr 
510 

Arg lie Pro Pro 
525 

Glu Ser Val Leu 



Cys Ser Gin Phe 
560 

Pro Asp Glu Tyr 
575 

Ser Gly Gin Cys 
590 

Cys Asp Asp Asp 
605 

Leu Trp Glu Cys 



Cys Asp Gly Phe 
64 0 

Ser Phe Cys Gin 



Ser Arg Asp Leu 
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660 

Trp Cys Asp Gly 
67 5 

Cys Val Thr Leu 
690 

His Arg Ala Ala 
705 

lie Leu Ser Gin 



Val Thr Lys Leu 
740 

Leu His Ser Asn 

7 5 5 

Leu Val Asn Gly 
770 

Cys Thr Lys Gin 
785 

Arg lie Leu Gly 



Cys Ser Leu Gin 
820 

lie Ala Lys Lys 
835 

Glu Asn Ala Ala 
850 

His Pro Ser Val 
865 

His Pro Arg Tyr 



Glu Leu Ser Glu 
900 

Leu Pro Asn Pro 
915 

Thr Gly Trp Gly 
930 

Gly Glu Val Arg 
945 

Met Lys Thr lie 



Thr Val Asp Ser 



Glu Ala Asp Cys 
680 

Ser lie Asn Val 
695 

Thr Glu His His 
710 

Leu Ala Cys Lys 
725 

lie Gin Glu Gin 



Trp Glu Ser Leu 
760 

Gin Ser Cys Glu 
775 

Asp Cys Gly Arg 
790 

Gly Arg Thr Ser 
805 

Ser Glu Pro Ser 



Trp Val Leu Thr 
840 

Val Trp Lys Val 
855 

Phe Met Gin Thr 
870 

Ser Arg Ala Val 
885 

Asp lie Ser Glu 



Glu Gin Trp Leu 
920 

His Met Gly Asn 
935 

lie lie Ser Leu 
950 

Thr Thr Arg Met 
965 

Cys Met Gly Asp 
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665 

Ser Asp Ser Ser 



Asn Ser Ser Ser 
700 

Val Cys Ala Asp 
715 

Gin Met Gly Leu 
730 

Glu Lys Glu Pro 
745 

Asn Gly Thr Thr 



Ser Arg Ser Lys 
780 

Arg Pro Ala Ala 
795 

Arg Pro Gly Arg 
810 

Gly His lie Cys 
825 

Val Ala His Cys 



Val Leu Gly He 
860 

Arg Phe Val Lys 
875 

Val Asp Tyr Asp 
890 

Thr Gly Tyr Val 
905 

Glu Pro Asp Thr 



Lys Met Pro Phe 
940 

Glu His Cys Gin 
955 

He Cys Ala Gly 
970 

Ser Gly Gly Pro 



670 

Asp Glu Trp Asp 
685 

Phe Leu Met Val 



Gly Trp Gin Glu 
720 

Gly Glu Pro Ser 
735 

Arg Trp Leu Thr 
750 

Leu His Glu Leu 
765 

He Ser Leu Leu 



Arg Met Asn Lys 
800 

Trp Pro Trp Gin 
815 

Gly Cys Val Leu 
830 

Phe Glu Gly Arg 
845 

Asn Asn Leu Asp 



Thr He He Leu 
880 

He Ser He Val 
895 

Arg Pro Val Cys 
910 

Tyr Cys Tyr He 
925 

Lys Leu Gin Glu 



Ser Tyr Phe Asp 
960 

Tyr Glu Ser Gly 
975 

Leu Val Cys Glu 
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980 985 990 

Lys Pro Gly Gly Arg Trp Thr Leu Phe Gly Leu Thr Ser Trp Gly Ser 
995 1000 1005 

Vai Cys Phe Ser Lys Val Leu Gly Pro Gly Val Tyr Ser Asn Vai Ser 
1010 1015 1020 

Tvr Phe Vai Glu Trp lie Lys Arg Gin lie Tyr lie Gin Thr Phe Leu 
1025 1030 1035 1040 

Leu Asn 



<210> 3 

<211> 3547 

<212> DNA 

<213> Murine ccrin 



<400> 3 

gtggcagacg 

ggtttccttc 

atgctacctc 

ctgcgcgggg 

ccgtggcttc 

agccccgcct 

gaagctggtg 

cgccctcatc 

tttcaaatca 

tattcctgta 

gtccactcca 

aagcacctgc 

ggcacctctc 

gtacctccat 

ccctgagtgc 

tgaggctgct 

ttccctcaga 

ctgcttctca 

cctgtgtacc 

tgatgactgg 

aggcaagtgc 

tgacgagcaa 

cattgcggct 

caactgctct 

caccttccag 

tgacagtcag 

atcctgtgct 

tgagcccatc 

ttaccttggc 

tgcccttgta 

tccaaagtgt 

ctccaaagag 

cgactgcaat 

agatgtggaa 

ctccaggaga 

ttgtaaagag 

aatctgcgat 

ccaagacaat 

cggatgggtc 



gtccctcact 
agcgttcggg 
tcctgcagag 
gttccggggg 
ctctccgggt 
gctccagacg 
actgctaact 
gtgctgctgg 
aatgacagtg 
aatacagttt 
gcctggacac 
atgaacatca 
ttgccaattg 
cgcctcagtt 
gttgttgatg 
aaagaaggat 
tgctctcagt 
ct gcagcagg 
agcgggctct 
agcgacgagg 
ctccattaca 
aactgtgatt 
gagtgggtgt 
tgtcacagcc 
tgtgatgggg 
acgccatgtc 
ggtagctctc 
actttggaac 
cacagaactc 
caaaccaact 
gatgtgaata 
cgctgtgagt 
caatttccag 
gaatgcr etc 
tgtgacggcc 
agagct ct tt 
gggtt tccag 
gagctggaat 
aactcctcaa 



cctgtggctt 
tcagctccgt 
tccctccaac 
agactgeagg 
ccaagttcca 
tcttgagagc 
tgetgegett 
ccatcctgct 
aacctttggt 
attatgagaa 
cgagagctcc 
ctcacagcca 
tcaaaaacat 
gctatcaaca 
gcgatgacag 
gcgaatctgt 
ttagggacca 
aacatggaaa 
gcgtccccaa 
cgcattgcaa 
gcctcttgtg 
gtaatctcac 
gcgatgggga 
agggcctggt 
acgaagactg 
cagaaggaga 
tgtgtgactc 
tetgeatgaa 
aaaaggaagc 
gttacaaata 
caggacaacg 
ctgttctggg 
aggaaagttc 
cgagtcactt 
aggctgactg 
gggaatgtcc 
a— gt ccaga 
gtgccaacca 
acaariciaa 



ggegteggag 
geggagagee 
caccgccctc 
tggagccgtc 
ggctcccggc 
agacaggagc 
cctcctgctg 
gtcctttgtg 
cactgatggg 
cacaggggcg 
ttctccagag 
gtgt caaatt 
ggacatggag 
tatccrgctc 
gcaaggtctt 
cctgggaatg 
cactgagact 
gcaat cactc 
gaagctgcag 
ctgcagcaag 
tgatgggtac 
aaaagagcat 
ccatgactgt 
ggaatgeaga 
taaggatggg 
acagggatgc 
agacagcagc 
tttgccctac 
gtccatcagc 
cctcatgttt 
catcccgcct 
aatcgttggc 
agacaatcaa 
caaatgccgc 
tgacgacgac 
atttaataag 
cagtatggat 
tgagtgtgtg 
tgaatggggc 



acgctggcag 
cgctgctctt 
cgtgcactga 
ggacccggcc 
agctggaagg 
gtgggcgagg 
gtgctcatcc 
ggaacattaa 
gaagctcgag 
ccctctctgc 
gaccagagtc 
ctgccctacc 
aagttcctca 
ttcggctgta 
ctaccctgta 
gtgaactcct 
aacagcagtg 
tgtggagggg 
tgtaaegget 
gatctgtttc 
gatgactgtg 
cgctgtggag 
gtggacaagt 
agtggacagt 
agtgacgagg 
cttggcagtt 
ctgagtaact 
aaccatacac 
tgggagtcat 
ttegcttgea 
tgcagactcc 
ctgcagtggc 
acttgcctcc 
t egggacgat 
agtgacgagg 
caatgtctga 
gaaaaaaact 
ccgcgtgacc 
ugtgtgaccc 



tcatgggcag 
gtcctgggcg 
aeggtcttgg 
ccttggggac 
attgetttgg 
gctgtcctca 
cctgcatctg 
aaagggttta 
tgcctggtgt 
cccccagcca 
acaggaacac 
acagcaegtt 
agttcttcac 
gcctcgcctt 
gatctttcxg 
cctggccgga 
t cagaaagag 
gegagagett 
ataatgactg 
actgtggcac 
gggacctgag 
atgggcgctg 
ctgatgaggt 
gcatccctag 
agaactgeag 
cctgcgtcga 
gcagtcaatg 
attatccaaa 
cccttttccc 
ccattttggt 
tgtgtgagca 
ctgaagacac 
tgcccaatga 
gcgttctggg 
agaactgtgg 
agcatacatt 
gctcattttg 
tttggtgcga 
tctctaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 
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tgggaactct 
rgacggctgg 
gtctgtgacc 
ctgggagaat 
aagcagaagt 
ccgaatgaac 
gtgctctctg 
gtgggtcctg 
ggtatttggc 
gaccatcctg 
ggagctgagc 
ggaggagtat 
taaaatgccc 
gtcctatttt 
caccgtggac 
acagtggaca 
acctggagtg 
ccagaccttt 
tggagaatga 
cacggacagg 
actcagg 



tccrcattgc 
cgggagacgr 
aagctgatcc 
ctcaatggga 
gagatttccc 
aagaggatcc 
cagagtgaac 
acagttgccc 
ataaacaacc 
ctacatcccc 
gatgatatca 
ctagaaccag 
tttaagctgc 
gacatgaaga 
tcctgcatgg 
ttatttggtt 
tacagcaatg 
ctccaaaaga 
ccctctgaaa 
acgctcaaca 



tgactgttca 
tgagrcagct 
caggacagga 
gcaccttgca 
ttctgtgctc 
ttgggggtcg 
ccagtggaca 
attgctttga 
tggaccatcc 
gttacagtcg 
atgagacaag 
atacgtactg 
aggagggaga 
ccatcaccaa 
gagacagcgg 
taacttcatg 
tgtcttactt 
aatcccaagg 
cagaagcttg 
tgcaccgcaa 



caaatctgca 
ggccrgcaag 
aggccagcag 
ggagctgctg 
caagcaagac 
gactagtcgt 
tatctgtggc 
agggagagaa 
atcaggcttc 
agcagtggta 
ctacgtcaga 
ctacatcaca 
ggtccgcatt 
tcggatgatc 
tgggcctctg 
gggctccgtc 
tgtgggctgg 
ataatcagag 
tcctgccaag 
gatctctcct 



aaggaacacc 
cagatgggtr 
tggctgaggt 
gtatacaggc 
tgtggccgcc 
cctgggaggt 
tgtgtccrca 
gacgctgatg 
atgcagaccc 
gactatgata 
cctgtctgcc 
ggctggggcc 
atccctctgg 
tgtgctggct 
gtttgtgaac 
tgcttttcca 
attgaaagac 
actttgtggg 
agctgtacga 
gtttgtgcra 



acgtgtgrgc 
taggagaacc 
tgtaccccaa 
actcctgccc 
gccctgctgc 
ggccgtggca 
ttgccaagaa 
tttggaaagt 
gctttgtgaa 
tcagcgtggr 
tacccagtcc 
acatgggcaa 
agcagtgcca 
atgagtctgg 
gacccggagg 
aagttctggg 
aaatatatat 
gaaacctaca 
acaggcgrtt 
gatgagrt zz 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3547 



<210> 4 

<211> 1113 

<212> PRT 

<213> Murine corin 



<400> 4 
Met Gly Arg 
1 



Val Ser Phe Ser 
5 



Val Arg Val 
10 



Ser Ser Val Arg 



Arg Ala 
15 



Arg Cys Ser 



Cys Pro Gly Arg 
20 



Cys Tyr Leu 
25 



Ser Cys Arg Val 
30 



Pro Pro 



Thr Thr Ala 

35 



Leu Arg Ala Leu 



Asn Gly Leu Gly Cys Ala Gly Val Pro 
40 45 



Gly Glu Thr 
50 



Ala Gly Gly Ala 
55 



Val Gly Pro Gly Pro Leu Gly Thr Arg 
60 



Gly Phe Leu 
65 



Ser Gly Ser Lys 
70 



Phe Gin Ala Pro Gly Ser Trp Lys Asp 
75 30 



Cys Phe Gly 
Val Gly Glu 



Ala Pro Pro Ala 
85 

Gly Cys Pro Gin 
100 



Pro Asp Val 
90 

Lys Leu Val 
105 



Leu Arg Ala Asp 



Arg Ser 
95 



Thr Ala Asn Leu Leu Arg 
110 



Phe Leu Leu 
115 



Leu Val Leu lie 



Pro Cys lie 
120 



Cvs Ala Leu lie 
125 



Val Leu 



Leu Ala lie 
130 

Lys Ser Asn 
145 

Pro Gly Val 



Leu Leu Ser Phe 
135 

Asp Ser Glu Pro 
150 

lie Pro Val Asn 
165 



Val Gly Thr 
Leu Val Thr 



Thr Val Tyr 
170 



Leu Lys Arg Val Tyr Phe 
140 



Asp Gly Glu Ala 
155 

Tyr Giu Asn Thr 



Arg Val 
160 

Gly Ala 
175 
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Pro Ser Leu Pro Pro Ser Gin Ser Thr Pro Ala Trp Thr Pro Arg Ala 
180 185 190 

Pro Ser Pro Glu Asp Gin Ser His Arg Asn Thr Ser Thr Cys Met Asn 
195 200 205 

lie Thr His Ser Gin Cys Gin lie Leu Pro Tyr His Ser Thr Leu Ala 
210 215 220 

Pro Leu Leu Pro lie Val Lys Asn Met Asp Met Glu Lys Phe Leu Lys 
225 230 235 240 

Phe Phe Thr Tyr Leu His Arg Leu Ser Cys Tyr Gin His lie Leu Leu 
245 250 255 

Phe Gly Cys Ser Leu Ala Phe Pro Glu Cys Val Val Asp Gly Asp Asp 
260 265 270 

Arg Gin Gly Leu Leu Pro Cys Arg Ser Phe Cys Glu Ala Ala Lys Glu 
275 280 285 

Gly Cys Glu Ser Val Leu Gly Met Val Asn Ser Ser Trp Pro Asp Ser 
290 295 300 

Leu Arg Cys Ser Gin Phe Arg Asp His Thr Glu Thr Asn Ser Ser Val 
305 310 315 320 

Arg Lys Ser Cys Phe Ser Leu Gin Gin Glu His Gly Lys Gin Ser Leu 
325 330 335 

Cys Gly Gly Gly Glu Ser Phe Leu Cys Thr Ser Gly Leu Cys Val Pro 
340 345 350 

Lys Lys Leu Gin Cys Asn Gly Tyr Asn Asp Cys Asp Asp Trp Ser Asp 
355 360 " 365 

Glu Ala His Cys Asn Cys Ser Lys Asp Leu Phe His Cys Gly Thr Gly 
370 375 380 

Lys Cys Leu His Tyr Ser Leu Leu Cys Asp Gly Tyr Asp Asp Cys Gly 
385 ' 390 395 400 

Asp Leu Ser Asp Glu Gin Asn Cys Asp Cys Asn Leu Thr Lys Glu His 
405 410 415 

Arg Cys Gly Asp Gly Arg Cys lie Ala Ala Glu Trp Val Cys Asp Gly 
420 " 425 430 

Asp His Asp Cys Val Asp Lys Ser Asp Glu Val Asn Cys Ser Cys His 
4 35 4 40 4 45 

Ser Gin Gly Leu Val Glu Cys Arg Ser Gly Gin Cys lie Pro Ser Thr 
450 455 460 

Phe Gin Cys Asp Gly Asp Glu Asp Cys Lys Asp Gly Ser Asp Glu Glu 
4 65 4 70 4 75 4 80 

Asn Cys Ser Asp Ser Gin Thr Pro Cys Pro Glu Gly Glu Gin Gly Cys 
4 85 4 90 4 95 
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Leu Gly Ser Ser 
500 

Ser Asp Ser Ser 
515 

Glu Leu Cys Met 
530 

Leu Gly His Arg 
545 

Leu Phe Pro Ala 



Phe Ala Cys Thr 
580 

Arg lie Pro Pro 
595 

Glu Ser Vai Leu 
610 

Cys Asn Gin Phe 
625 

Pro Asn Glu Asp 



Ser Gly Arg Cys 
660 



Cys Asp Asp Asp 
675 

Leu Trp Glu Cys 
690 

Cys Asp Gly Phe 
705 

Ser Phe Cys Gin 



Pro Arg Asp Leu 
740 

Asp Glu Trp Gly 
755 

Leu Leu Thr Val 
770 

Gly Trp Arg Glu 
785 

Gly Glu Pro Ser 



Cys Val Glu Ser 



Leu Ser Asn Cys 
520 

Asn Leu Pro Tyr 
535 

Thr Gin Lys Glu 
550 

Leu Val Gin Thr 
565 

lie Leu Val Pro 



Cys Arg Leu Leu 
600 

Gly He Val Gly 
615 

Pro Glu Glu Ser 
630 

Val Glu Glu Cys 
645 

Val Leu Gly Ser 



Ser Asp Glu Glu 
680 

Pro Phe Asn Lys 
695 

Pro Asp Cys Pro 
710 

Asp Asn Glu Leu 
725 

Trp Cys Asp Gly 



Cys Val Thr Leu 
760 

His Lys Ser Ala 
775 

Thr Leu Ser Gin 
790 

Val Thr Lys Leu 
805 
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Cys Ala Gly Ser 
505 

Ser Gin Cys Glu 



Asn His Thr His 
540 

Ala Ser He Ser 
555 

Asn Cys Tyr Lys 
570 

Lys Cys Asp Val 
585 

Cys Glu His Ser 



Leu Gin Trp Pro 
620 

Ser Asp Asn Gin 
635 

Ser Pro Ser His 
650 

Arg Arg Cys Asp 
665 

Asn Cys Gly Cys 



Gin Cys Leu Lys 
700 

Asp Ser Met Asp 
715 

Glu Cys Ala Asn 
730 

Trp Val Asp Cys 
745 

Ser Lys Asn Gly 



Lys Glu His His 
780 

Leu Ala Cys Lys 
795 

He Pro Gly Gin 
810 



Ser Leu Cys Aso 
510 

Pro He Thr Leu 
525 

Tyr Pro Asn Tyr 



Trp Glu Ser Ser 
560 

Tyr Leu Met Phe 
575 

Asn Thr Gly Gin 
590 

Lys Glu Arg Cys 
605 

Glu Asp Thr Asp 



Thr Cys Leu Leu 
640 

Phe Lys Cys Arg 
655 

Gly Gin Ala Asp 
670 

Lys Glu Arg Ala 
685 

His Thr Leu He 



Glu Lys Asn Cys 
720 

His Glu Cys Val 
735 

Ser Asp Ser Ser 
750 

Asn Ser Ser Ser 
765 

Val Cys Ala Asp 



Gin Met Gly Leu 
8 00 

Glu Gly Gin Gin 
815 
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Tro Leu Arg Leu Tyr Pro Asn Trp Glu Asn Leu Asn Giy Ser Thr Leu 
32C ^ 825 830 

Gin Glu Leu Leu Val Tyr Arg His Ser Cys Pro Ser Arg Ser Glu lie 
835 840 845 

Ser Leu Leu Cys Ser Lys Gin Asp Cys Gly Arg Arg Pro Ala Ala Arg 
850 ' 855 " ^ 860 

Met Asn Lys Arg lie Leu Gly Gly Arg Thr Ser Arg Pro Gly Arg Trp 
865 870 875 880 

Pro Trp Gin Cys Ser Leu Gin Ser Glu Pro Ser Gly His lie Cys Gly 
885 890 895 

Cys Val Leu lie Ala Lys Lys Trp Val Leu Thr Val Ala His Cys Phe 
900 905 910 

Glu Gly Arg Glu Asp Ala Asp Val Trp Lys Val Val Phe Gly lie Asn 
915 920 925 

Asn Leu Asp His Pro Ser Giy Phe Met Gin Thr Arg Phe Val Lys Thr 
930 935 940 

lie Leu Leu His Pro Arg Tyr Ser Arg Ala Val Val Asp Tyr Asp lie 
945 950 955 960 

Ser Val Val Glu Leu Ser Asp Asp lie Asn Glu Thr Ser Tyr Val Arg 
965 " 970 975 

Pro Val Cys Leu Pro Ser Pro Glu Glu Tyr Leu Glu Pro Asp Thr Tyr 
980 985 990 

Cys Tyr lie Thr Gly Trp Gly His Met Gly Asn Lys Met Pro Phe Lys 
995 1000 1005 

Leu Gin Glu Gly Glu Val Arg lie lie Pro Leu Glu Gin Cys Gin Ser 
1010 * 1015 1020 

Tyr Phe Asp Met Lys Thr lie Thr Asn Arg Met lie Cys Ala Gly Tyr 
1025 1030 1035 1040 

Glu Ser Gly Thr Val Asp Ser Cys Met Gly Asp Ser Gly Gly Pro Leu 
1045 1050 1055 

Val Cys Glu Arg Pro Gly Gly Gin Trp Thr Leu Phe Gly Leu Thr Ser 
1060 1065 1070 

Trp Gly Ser Val Cys Phe Ser Lys Val Leu Gly Pro Gly Val Tyr Ser 
1075 1080 1085 

Asn Val Ser Tyr Phe Val Gly Trp lie Glu Arg Gin lie Tyr lie Gin 
1090 1095 1100 

Thr Phe Leu Gin Lys Lys Ser Gin Gly 
1105 1110 

<210> 5 
<211> 24 
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<212> DNA 

<213> Human corin 

<400> 5 

cagttggttt gaacaagtgc aggg 

<210> 6 

<211> 24 

<212> DNA 

<213> Human Corin 

<400> 6 

tgcaaggagg gatacgctcg cctg 

<210> 7 

<211> 24 

<212> DNA 

<213> Human corin 

<400> 7 

aatcccaaga acagactcac agcg 

<210> 8 

<211> 24 

<212> DNA 

<213> Human corin 

<400> 8 

cgggtcacag agagagctac cacc 

<210> 9 

<211> 24 

<212> DNA 

<213> Human corin 

<400> 9 

ggtctccttc ttgacatgaa tctg 

<210> 10 

<211> 30 

<212> DNA 

<213> Human corin 

<400> 10 

aacaaaacga tccttggagg tcggacgagt 

<210> 11 

<211> 24 

<212> DNA 

<213> Mouse corin 

<400> 11 

tcttctgtgt actaaacaag 'actg 

<210> 12 

<211> 24 

<212> DNA 

<213> Mouse corin 

<400> 12 
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24 



24 



24 



24 



24 



30 
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aggccccagg actttggaaa agca 24 

<210> 13 

<211> 24 

<212> DNA 

<213> Mouse corin 

<400> 13 

acagtggcct gaagacacag attg 24 

<210> 14 

<211> 24 

<211> DNA 

<213> Human corin 

<400> 14 

acagagcatc gctgcgggga cggg 24 

<210> 15 

<211> 30 

<212> DNA 

<213> Human corin 

<400> 15 

aacaaaagga tccttggagg tcggacgagt 30 

<210> 16 

<211> 23 

<212> DNA 

<213> Human corin 

<400> 16 

cggagcccca tgaagttaat cca 23 

<210> 17 

<211> 24 

<212> DNA 

<213> Human GAPDH 

<400> 17 

gtcaatgtcc caaacgtcac caga 24 

<210> 18 

<211> 29 

<212> DNA 

<213> Human GAPDH 

<400> 18 

atttcgggaa tgctgagaaa acagacaga 29 
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